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POLYPEPTIDES WITH PROTEIN DISULFIDE REDUCING PROPERTIES 



FIELD OF THE INVENTION 

The present invention relates to polypeptides capable of reducing disulfide 
5 bonds, including variants of protein disulfide isomerase. The invention also relates to 
nucleotide sequences encoding the polypeptides, as well as nucleic acid constructs, 
vectors, and host cells comprising the nucleotide sequences. Further, the invention 
relates to methods for producing and using the polypeptides, including use of the 
polypeptides for reducing the allergenicity of allergenic proteins. 

10 

BACKGROUND OF THE INVENTION 

Many of the proteins responsible for allergic reactions have intramolecular disul- 
fide bonds. Disulfide bonds in proteins are formed between cysteine residues and 
have the function of stabilising the secondary and tertiary structure of the protein. By 

15 reducing these bonds it has been shown that it is possible to change the allergic 
properties of a given protein. 

Among the protein disulfide redox enzymes are two main groups of enzymes, 
thioredoxin (TRX) and protein disulfide isomerase (PDI). The redox enzymes cata- 
lyse the general reaction: 

20 Ri-SH + R 2 -SH + Enzox <-> RrS-S-R 2 + Enz re d 

where Ri and R 2 represent protein entities which are the same or different, either 
within the same polypeptide or in two polypeptides, Enzox is a protein disulfide redox 
enzyme in the oxidised state and Enz re d is a protein disulfide redox enzyme in the 
reduced state. 

25 EC 5.3.4.1 (Enzyme Nomenclature, Academic Press, Inc. 1992), Protein disul- 

fide isomerase (alternative named S-S rearrangase) is an enzyme capable of catalys- 
ing the rearrangement of both intrachain and interchain -S-S- bonds in proteins. In 
order words, protein disulfide isomerase catalyzes the oxidation, reduction and isom- 
erization of protein disulfides. Most of the enzymes capable of reducing disulfide 

30 bonds have the following amino acid sequence in common: 
R 1 - Cys - Xi - Yi - Cys - R 2 (Formula 1 ) 
wherein R 1 and R 2 each are different amino acid sequences. For protein disulfide 
isomerase (PDI) generally Xi is Gly, and Yi is His, and for thioredoxin (TRX) gener- 
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ally Xi is Gly, and Yi is Pro, using the conventional nomenclature for amino acid 
residues. In nature, PDI consists of two subunits, each consisting of two homologous 
domains, where each domain comprises Formula 1 depicted above, whereas TRX 
consists of one domain only. The crystal structure of a protein disulfide isomerase 

5 has been reported in "Crystal structure of the protein disulfide bond isomerase, 
DsbC, from Escherichia coli, McCarthy A. A. et al., 2000, Nat Struct Biol, 7(3):196-9". 

It has been shown that mitigation of food allergy is achieved by the reduction of 
disulfide bonds using thioredoxin (TRX), e.g. US 5,792,506 or B.B. Buchanan et al. 
(Proc. Natl. Acad. Sci, USA, Vol. 94, pp. 5372-5377, 1997) and WO 96/12799 de- 

10 scribing experiments performed on sensitised dogs fed with TRX treated food, show- 
ing TRX as a potent reductant of allergenic proteins. However, the use of thioredoxin 
on an industrial scale for the production of low allergy food remains to be exploited. 
This is due to the fact that it has not yet been possible to achieve extracellularly ex- 
pressed disulfide reducing proteins in quantities large enough for industrial applica- 

15 tion. 

In nature, protein disulfide isomerase is expressed intracellular^ only, and has 
proven very difficult to express extracellularly due to proteolytic degradation. 

Extracellularly expressed variants have been described previously, for example 
in WO 95/00636 disclosing a fungal protein disulfide isomerase from Aspergillus and 
20 sequences for the recombinant production of the protein, and in WO 95/01425 de- 
scribing compositions comprising protein disulfide redox enzymes, both applications 
describing the use of the protein variants for the treatment or degradation of in par- 
ticular scleroproteins, especially in hair, skin and wool. 

There is a need for providing the combination of a highly disulfide reducing en- 
25 zyme capable of large-scale recombinant production with a high output, which has so 
far not been possible. 

SUMMARY OF THE INVENTION 

The inventor has provided a protein disulfide isomerase variant having in- 
30 creased reducing properties as compared to the wild-type protein and capable of be- 
ing expressed extracellularly in a high yield. 

Accordingly, the invention relates to polypeptides capable of reducing disulfide 
bonds in proteins, which polypeptide comprises or consists of an amino acid se- 
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quence having (i) at least 60% similarity with the amino acid sequence set forth in 
amino acid number 21-281 of SEQ ID NO: 15; or (ii) at least 60% similarity with the 
amino acid sequence set forth in SEQ ID NO: 17; provided that a position in the poly- 
peptide corresponding to amino acid residues numbered 58-61 in SEQ ID NO: 13 is 
5 selected from the group consisting of: i) Cys-Gly-Pro-Cys, ii) Cys-Ala-Thr-Cys, iii) 
Cys-Val-Leu-Cys, and iv) Cys-Gly-Tyr-Cys. 

The invention further relates to a polypeptide capable of reducing disulfide 
bonds which polypeptide is a variant of a parent protein disulfide isomerase by hav- 
ing an amino acid sequence differing from that of the parent protein disulfide isom- 

10 erase in a position corresponding to amino acid residues numbered 58-61 in SEQ ID 
NO: 13, which position is Cys-Gly-Pro-Cys, ii) Cys-Ala-Thr-Cys, iii) Cys-Val-Leu-Cys, 
or iv) Cys-Gly-Tyr-Cys for the variant polypeptide. 

The present invention also relates to a method for providing a polypeptide ca- 
pable of reducing disulfide bond and to the polypeptides obtainable by such methods, 

15 which polypeptide is a variant of a protein disulfide isomerase, the method compris- 
ing the step of: altering at least one amino acid residue by amending the amino acid 
Xi to Glu and/or the amino acid Yi to Pro in the active site corresponding to -Cys-Xi- 
Yi-Cys- of a parent protein disulfide isomerase; to obtain a variant of the protein di- 
sulfide isomerase comprising Cys-Gly-Pro-Cys. 

20 The invention further relates to a polypeptide, which is a protein disulfide isom- 

erase variant having the formula of: X - A - [Z - B] n - Y, 

wherein (i) M A" is the amino acid sequence: Cys-Gly-Pro-Cys; (ii) "X" is an amino acid 
sequence comprising a sequence corresponding to SEQ ID NO:1 or a functional 
equivalent thereof; (iii) "Y" is an amino acid sequence comprising a sequence corre- 

25 sponding to SEQ ID NO:2 or a functional equivalent thereof; (iv) "Z" is an amino acid 
sequence comprising a sequence corresponding to SEQ ID NO: 11 or a functional 
equivalent thereof; (v) "B" is individually selected from the amino acid sequence Cys- 
Gly-Pro-Cys or Cys-Gly-His-Gly or Cys-Ala-Thr-Cys or Cys-Pro-His-Cys or Cys-Val- 
Leu-Cys or Cys-Gly-Tyr-Cys; and (vi) n = 0 or an integer from 1 to 3. 

30 The present invention also relates to nucleotide sequences encoding the poly- 
peptides of the invention and to nucleic acid constructs, vectors, and host cells com- 
prising the nucleotide sequences as well as methods for producing the polypeptides. 
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Furthermore, the invention relates to compositions comprising the polypeptides 
of the invention, and to the use of the polypeptides, e.g. for reducing the allergenicity 
of an allergenic protein, e.g. in food and feed products and cosmetics, or for increas- 
ing the digestibility of food or feed. The invention further relates to food or feed addi- 
5 tives comprising a polypeptide of the invention. 



DRAWINGS 

Figure 1 shows the construction of the plasmid pCaHj 527. 
Figure 2 depicts the construction of the plasmid pCaHj 548. 
10 Figure 3 shows the structure of A. oryzae PDI. 

Figure 4 shows alignment of amino acid sequences of a PDI from Aspergillus 

oryzae with a PDI from 7. reseii, H. insolens, S. cerevisiae and A. niger. 
Percent Similarity in upper triangle. Percent Divergence in lower triangle. 
Figure 5 shows results from Examples, Comparison of recombinant fungal protein 
15 disulfide isomerase (PDI) and DsbA isomerase. 

Figure 6 shows results from Examples, Optimisation of the 'PDI:substrate' 

ratio. 

Figure 7 shows alignment of 7 PDI gene product, cf. Examples. 



20 

DETAILED DESCRIPTION OF THE INVENTION 



Polypeptides of the invention and methods of providing such polypeptides 

The inventor has provided a protein disulfide isomerase variant (SEQ ID NO: 15, 
25 where amino acid number 21-281 of SEQ ID NO: 15 is the polypeptide without the 
signal peptide) having increased capacity for reducing protein disulfide bond and ca- 
pable of being expressed extracellularly in a high yield. 

Accordingly, within the scope of the invention is a polypeptide capable of reduc- 
ing disulfide bonds, the polypeptide selected from the group of: 
30 (a) a polypeptide consisting of or comprising an amino acid sequence having at 
least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, 
at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 
99%, or 100% identity or similarity with the amino acid sequence of SEQ ID 
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NO: 15 or the amino acid sequence set forth in amino acid number 21-281 of 
SEQ ID NO:15; 

(b) a polypeptide consisting of or comprising an amino acid sequence having at 
least 60% at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, 

5 at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 

99%, or 100% identity or similarity with the amino acid sequence set forth in 
SEQ ID NO:17; 

(c) a polypeptide capable of reducing disulfide bonds, which polypeptide comprises 
or consists of an amino acid sequence encoded by a nucleotide sequence 

10 which hybridizes under low, medium or high stringency conditions with (i) SEQ 
ID NO:14; (ii) SEQ ID NO:16; (iii) a subsequence of (i) or (ii) of at least 100 nu- 
cleotides; or (iv) a complementary strand of (i), (ii) or (iii); and 

(d) a polypeptide capable of reducing disulfide bonds which (i) is encoded by the 
nucleotide sequence contained in plasmid pCaHj548; or (ii) comprises or con- 

15 sist of an amino acid sequence having at least 60%, at least 65%, at least 70%, 

at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 
96%, at least 97%, at least 98%, at least 99%, or 100% identity or similarity with 
the amino acid sequence encoded by the nucleotide sequence contained in 
plasmid pCaHj548; 

20 provided that a position of the polypeptide corresponding to amino acid residues 
numbered 58-61 in SEQ ID NO: 13 is selected from the group consisting of: i) Cys- 
Gly-Pro-Cys; ii) Cys-Ala-Thr-Cys; iii) Cys-Val-Leu-Cys; and iv) Cys-Gly-Tyr-Cys. 

SEQ ID NO: 12 is the nucleotide sequence of a wt. A. oryzae PDI gene where the 
25 coding sequence is nucleotide number 71-444, 502-880, 692-1401 and 1478-1 832 of 
SEQ ID NO:12. The corresponding wt-PDI amino acid sequence is shown in SEQ ID 
NO: 13, where amino acid number 1-20 is the signal peptide. The active sites of the 
wt. A. oryzae PDI (i.e. i.a. amino acid number 58-61 of SEQ ID NO: 13) is coded by 
nucleotides 242-253 and 1385-1396 of SEQ ID NO: 12. 
30 The polypeptide SEQ ID NO: 15 is identical to amino acid number 1 to 281 of 

SEQ ID NO: 13, except that the polypeptide given by SEQ ID NO: 15 has the se- 
quence Cys-Gly-Pro-Cys in the position corresponding to 58-61 of SEQ ID NO: 13, 
i.e. amino acid number 61 is Pro in SEQ ID NO:15 and amino acid number 61 is His 
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in SEQ ID NO: 13. SEQ ID NO: 14 is the nucleotide sequence coding for the amino 
acid sequence given by SEQ ID NO: 15. The amino acid number 1-20 of SEQ ID 
NO:15 is a signal peptide. Amino acid number 21-281 of SEQ ID NO:15 is the ma- 
ture variant PDI of the invention, i.e. without the signal peptide. The term M SEQ ID 

5 NO:15 M as used herein may also denote the amino acid sequence of SEQ ID NO:15 
without the signal sequence, i.e. the amino acid sequence given by amino acid num- 
ber 21-281 of SEQ ID NO:15. 

The polypeptide SEQ ID NO: 17 is identical to amino acid number 21 to 115 of 
SEQ ID NO:13, except that the polypeptide given by SEQ ID NO: 17 has Cys-Gly- 

10 Pro-Cys in the position corresponding to 58-61 in SEQ ID NO: 13. SEQ ID NO:16 is 
the nucleotide sequence coding for the amino acid sequence SEQ ID NO: 17. 

The amino acid sequence identity is the degree of identity between the two se- 
quences indicating a derivation of the first sequence from the second. The amino 

15 acid sequence similarity also takes into account conservative amino acid substitu- 
tions. The degree of identity or similarity may suitably be determined by means of 
computer programs known in the art. Both the degree of identity and similarity takes 
into account possible gaps. 

For purposes of the present invention, the degree of identity or similarity be- 

20 tween two amino acid sequences may be determined by the Clustal method (Higgins, 
1989, CABIOS 5: 151-153) using the LASERGENE™ MEGALIGN™ software 
(DNASTAR, Inc., Madison, Wl) with an identity table and the following multiple align- 
ment parameters: Gap penalty of 10 and gap length penalty of 10. Pairwise align- 
ment parameters were Ktuple=1, gap penalty=3, windows=5, and diagonals=5]. 

25 The degree of identity or similarity may also be determined according to the 
method described in Needleman, S.B. and Wunsch, CD., (1970), Journal of Molecu- 
lar Biology, 48, 443-45, with the following settings for amino acid sequence compari- 
son: GAP creation penalty of 3.0 and GAP extension penalty of 0.1. The determina- 
tion may be done by means of a computer program known such as GAP provided in 

30 the GCG program package (Program Manual for the Wisconsin Package, Version 8, 
August 1994, Genetics Computer Group, 575 Science Drive, Madison, Wisconsin, 
USA 53711). Two given sequences can be aligned according to the method de- 
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scribed in Needieman {supra) using the same parameters. This may be done by 
means of the GAP program {supra). 

The degree of hybridisation may be determined by the method described in J. 
5 Sambrook, E.F. Fritsch, and T. Maniatus, 1989, Molecular Cloning, A Laboratory 
Manual, 2d edition, Cold Spring Harbor, New York. 

For long probes of at least 100 nucleotides in length, very low to very high 
stringency conditions are defined as prehybridization and hybridization at 42°C in 5X 
SSPE, 0.3% SDS, 200 \xg/m\ sheared and denatured salmon sperm DNA, and either 
10 25% formamide for very low and low stringencies, 35% formamide for medium and 
medium-high stringencies, or 50% formamide for high and very high stringencies, 
following standard Southern blotting procedures. 

For long probes of at least 100 nucleotides in length, the carrier material is fi- 
nally washed three times each for 15 minutes using 2 x SSC, 0.2% SDS preferably at 
15 least at 45°C (very low stringency), more preferably at least at 50°C (low stringency), 
more preferably at least at 55°C (medium stringency), more preferably at least at 
60°C (medium-high stringency), even more preferably at least at 65°C (high strin- 
gency), and most preferably at least at 70°C (very high stringency). 

For short probes which are about 15 nucleotides to about 70 nucleotides in 
20 length, stringency conditions are defined as prehybridization, hybridization, and 
washing post-hybridization at about 5°C to about 10°C below the calculated T m using 
the calculation according to Bolton and McCarthy (1962, Proceedings of the National 
Academy of Sciences USA 48:1390) in 0.9 M NaCI, 0.09 M Tris-HCI pH 7.6, 6 mM 
EDTA, 0.5% NP-40, 1X Denhardt's solution, 1 mM sodium pyrophosphate, 1 mM so- 
25 dium monobasic phosphate, 0.1 mM ATP, and 0.2 mg of yeast RNA per ml following 
standard Southern blotting procedures. 

For short probes, which are about 15 nucleotides to about 70 nucleotides in 
length, the carrier material is washed once in 6X SCC plus 0.1% SDS for 15 minutes 
and twice each for 15 minutes using 6X SSC at 5°C to 10°C below the calculated T m . 

30 

In one preferred embodiment, the polypeptide of the invention has Cys-Gly-Pro- 
Cys in the position corresponding to amino acid residues numbered 58-61 in SEQ ID 
NO: 13. 



WO 00/70064 



PCT/DKOO/00265 



8 

The polypeptide of the invention may comprise SEQ ID NO: 15, the amino acid 
sequence of amino acid number 21-281 of SEQ ID NO: 15, or SEQ ID NO: 17. In a 
further embodiment, the polypeptide of the invention consists of the amino acid se- 
quence selected from the group consisting of: (i) SEQ ID NO:13 with Cys-Gly-Pro- 

5 Cys in amino acid residues numbered 58-61; (ii) SEQ ID NO: 13 without amino acid 
number 1-20 and with Cys-Gly-Pro-Cys in amino acid residues numbered 58-61 in 
SEQ ID NO:13; (iii) SEQ ID NO:15; (iv) amino acid number 21-281 of SEQ ID NO:15; 
and (v) SEQIDNO:17. 

The polypeptide of the invention may also be a protein disulfide isomerase hav- 

10 ing an amino acid sequence which in a position corresponding to amino acid resi- 
dues numbered 58-61 in SEQ ID NO: 13 is Cys-Gly-Pro-Cys, ii) Cys-Ala-Thr-Cys; iii) 
Cys-Val-Leu-Cys; or iv) Cys-Gly-Tyr-Cys. In a preferred embodiment the polypeptide 
is a protein disulfide isomerase having an amino acid sequence, which in a position 
corresponding to amino acid residues numbered 58-61 in SEQ ID NO: 13 is Cys-Gly- 

15 Pro-Cys. 

The invention also relates to a polypeptide capable of reducing disulfide bonds, 
which is a variant of the polypeptides as described herein, such as a variant, e.g., of 
SEQ ID NO: 15; a variant of the amino acid sequence set forh in amino acid number 
21-281 of SEQ ID NO: 15; or a variant of SEQ ID NO: 17, the variant comprising sub- 
20 stitution(s), deletion(s), and/or insertion(s) of one or more amino acids; provided that 
the variant comprises an amino acid sequence selected from the group consisting of: 
i) Cys-Gly-Pro-Cys, ii) Cys-Ala-Thr-Cys, iii) Cys-Val-Leu-Cys, and iv) Cys-Gly-Tyr- 
Cys, in a position corresponding to amino acid residues numbered 58-61 in SEQ ID 
NO:13. 

25 The polypeptides of the invention may, e.g., further comprise an amino acid se- 
quence which has at least 60%, at least 65%, at least 70%, at least 75%, at least 
80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 
98%, at least 99% or 100% identity or similarity with the amino acid sequence set 
forth in SEQ ID NO:18 (i.e. amino acid number 363 to 464 of SEQ ID NO:13) pro- 

30 vided the position corresponding to amino acid residues numbered 31-34 in SEQ ID 
NO: 18 is selected from the group consisting of: (i) Cys-Gly-Pro-Cys, (ii) Cys-Ala-Thr- 
Cys, (iii) Cys-Val-Leu-Cys, (iv) Cys-Gly-Tyr-Cys, and (v) Cys-Gly-His-Cys. 
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Thus, the polypeptide of the invention may, e.g., have one, two or more do- 
mains, e.g. tree domains, each domain comprising a sequence of the type Cys-Xi- 
YrCys, wherein the polypeptide comprises at least one Cys-Xi-Yi-Cys selected from 
the group consisting of: i) Cys-Gly-Pro-Cys, ii) Cys-Ala-Thr-Cys, iii) Cys-Val-Leu-Cys, 
5 and iv) Cys-Gly-Tyr-Cys. In one embodiment the polypeptide has two "Cys-Xi-Yi- 
Cys M regions, both of which are Cys-Gly-Pro-Cys. In other embodiments, the poly- 
peptides of the invention have only one domain comprising the region "Cys-Xi-Yi- 
Cys", which region is Cys-Gly-Pro-Cys. 

It is understood that Cys-Xi-Yi-Cys is in a position of the polypeptide conferring 
10 catalytic activity, i.e. in a position corresponding to the amino acid residues numbered 
58-61 in SEQ ID NO: 13, which position can be determined by the person skilled in 
the art by alignment of the amino acid sequence in question with SEQ ID NO: 13. 

The polypeptide of the invention may be a protein disulfide isomerase (PDI, EC 
5.3.4.1. ). 

15 

The invention also provides a polypeptide capable of reducing disulfide bonds, 
the polypeptide being a variant of a parent protein disulfide isomerase by having an 
amino acid sequence differing from that of the parent protein disulfide isomerase in a 
position corresponding to amino acid residues numbered 58-61 in SEQ ID NO: 13 (i.e. 
20 Cys-Xi-Yi-Cys-), which position for the variant polypeptide is selected from the group 
consisting of i) Cys-Gly-Pro-Cys, ii) Cys-Ala-Thr-Cys, iii) Cys-Val-Leu-Cys, and iv) 
Cys-Gly-Tyr-Cys. 

The amino acid sequence of the polypeptide of the invention may - apart from a 
difference in a position corresponding to amino acid residues numbered 58-61 in 

25 SEQ ID NO: 13 - also differ from the amino acid sequence of the parent protein disul- 
fide isomerase by having further substitution(s), deletion(s), and/or insertion(s) of one 
or more amino acids compared to the parent protein disulfide isomerase, provided 
that the polypeptide comprises the amino acid sequence Cys-Gly-Pro-Cys, Cys-Ala- 
Thr-Cys, Cys-Val-Leu-Cys, or Cys-Gly-Tyr-Cys in a position corresponding to amino 

30 acid residues numbered 58-61 in SEQ ID NO: 13. The amino acid sequence of the 
polypeptide being a variant of a parent protein disulfide isomerase may have at least 
65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 
95%, at least 96%, at least 97%, at least 98%, at least 99% identity or similarity with 



WO 00/70064 



PCT/DKOO/00265 



10 

the amino acid sequence of the parent protein disulfide isomerase. Also contem- 
plated are methods for providing such polypeptides which are variants of a parent 
protein disulfide isomerase (PDI) by amending the amino acid sequence of the parent 
PDI as described herein. 
5 The parent protein disulfide isomerase may be any PDI enzyme, i.e. an enzyme 
that is classified as a PDI under EC 5.3.4.1. The parent PDI may be a fungal PDI, 
such as, e.g. a filamentous fungal PDI, e.g. native to a strain of Humicola, Fusarium 
or Aspergillus, such e.g. H. insolens, F. solani pisi or A. oryzae. The parent PDI, may 
be derived from T. reseii t H. insolens: S. cerevisiae: or A. niger, the sequences of 

10 which in Figure 4 are compared by alignment to an A. oryzae PDI (SEQ ID NO: 13). 
The parent PDI may be SEQ ID NO: 13, a subsequence of SEQ ID NO: 13 having pro- 
tein disulfide isomerase activity, such as, e.g. amino acid no. 21-115 of SEQ ID 
NO:13, e.g. amino acid no. 1-115 of SEQ ID NO:13, e.g. amino acids no. 1-281 of 
SEQ ID NO:13, e.g. amino acids no. 21-281 of SEQ ID NO:13 f or a subsequence 

15 thereof having protein disulfide isomerase activity. The parent protein disulfide isom- 
erase may, e.g. be encoded by a nucleotide sequence which hybridises under low, 
medium or high stringency conditions with one or more of (i) SEQ ID NO: 12, SEQ ID 
NO:14 or SEQ ID NO:16; (ii) nucleotides 131 to 365 of SEQ ID NO:12; (iii) a subse- 
quence of (i) or (ii) of at least 100 nucleotides; or (iv) a complementary strand of (i), 

20 (ii) or (iii). 

Accordingly, within the scope of the invention is a method for providing a poly- 
peptide capable of reducing disulfide bond, the method comprising the step of: 
(a) altering at least one amino acid residue by amending the amino acid Xi to Glu, 
Ala, Val, or Gly and/or by amending the amino acid Yi to Pro, Thr, Leu, or Tyr in the 

25 active site corresponding to Cys-Xi-Yi-Cys of a parent protein disulfide isomerase; 
to obtain a variant of the parent protein disulfide isomerase comprising Cys-Gly-Pro- 
Cys, Cys-Ala-Thr-Cys, Cys-Val-Leu-Cys, or Cys-Gly-Tyr-Cys, in a position corre- 
sponding to amino acid number 58-61 of SEQ ID NO: 13. 

By the term "and/or" in the context of "altering at least one amino acid residue 

30 by amending the amino acid Xi to Glu and/or the amino acid Yi to Pro in the active 
site corresponding to Cys-Xi-Yi-Cys of a parent protein disulfide isomerase to obtain 
a variant of the parent protein disulfide isomerase comprising Cys-Gly-Pro-Cys in a 
position corresponding to amino acid number 58-61 of SEQ ID NO:13" is understood 
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that depending on the identity of Xi and Yi in the parent PDI, it may be needed to 
amend only one ofXi and Yi (if either Xi= Glu or Yi= Pro in the parent PDI) or it may 
be needed to amend both X 1 and Yi in order to achieve Xi= Glu and Yi= Pro (if Xi is 
not Glu and Yi is not Pro in the parent PDI). The term "amend' 1 includes any method 
5 which may be used to change the identity of the amino acids, such as, but not limited 
to, substitutions by site-directed mutagenesis of a corresponding DNA sequence, 
shuffling, or synthesising the polypeptide with the amino acid sequence of the variant 
by solid or liquid phase synthesis. 

The method may further to step (a) (i.e. outside Cys-Xi-YrCys) comprise the 

10 step (b) of substitution(s), deletion(s), and/or insertion(s) of one or more amino acids 
compared to the parent protein disulfide isomerase. Within the scope of the invention 
are polypeptides capable of reducing disulfide bonds obtainable by such method. 
The variant polypeptide may have an amino acid sequence which has at least 60%, 
at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at 

15 least 95%, at least 96%, at least 97%, at least 98%, at least 99% identity or similarity 
with the amino acid sequence of the parent protein disulfide isomerase. 

The present invention also provides a variant of a parent protein disulfide isom- 
erase having increased disulfide reducing properties when compared to the parent 

20 protein disulfide isomerase (e.g. a wild-type protein), the variant having the formula: 
"X - A - [Z - B] n - Y" as described above. 

When n=0, the formula corresponds to X-A-Y, wherein A is the amino acid se- 
quence: Cys-Gly-Pro-Cys, and X and Y are as defined above. The reducing capabil- 
ity of a protein variant of the formula X-A-Y is increased as compared to wild-type 

25 protein disulfide isomerase. The active site of the variant comprises A, but the activity 
is dependant on a larger part of the rest of the sequences, in particular a part of Y. 
Without being bound by theory this is believed to be due to the presentation of the 
active site, such as the tertiary structure of the variant. 

The amino acid sequence of A (Cys-Gly-Pro-Cys) may also be expressed as C- 

30 G-P-C by the symbols of the one-letter amino acid code. Further to the formula "X" is 
an amino acid sequence positioned prior to the active site A, comprising a sequence 
corresponding to SEQ ID NO: 1, or a functional equivalent thereof. The signal se- 
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quence of X (amino acid 1-20) according to the invention may be replaced by any 
other signal sequence. 

As described above naturally occurring DPI comprises domains with each an 
active site. For several applications it is preferred that the protein variant according to 

5 the invention comprises at least two active sites, corresponding to the formula above, 
wherein n=1. The active sites are separated by a sequence, whereby the positioning 
of the active sites is optimized with respect to presentation. The second active site, 
i.e. B, may be individually selected from the amino acid sequence Cys-Gly-Pro-Cys 
or Cys-Gly-His-Gly or Cys-Ala-Thr-Cys or Cys-Pro-His-Cys or Cys-Val-Leu-Cys or 

10 Cys-Gly-Tyr-Cys. The selection of the specific sequence depends on the modulation 
of the reducing properties of the protein variant as compared to the rearrangement 
properties. 

Additional sequences B may be comprised in the protein variant, each sepa- 
rated from the other(s), as exemplified by the above formula, wherein n=2 and n=3. 
15 Any of the above-mentioned active sites may be present as identical repeats or they 
may be present as sequences different from each other, depending on their use. 

The separating sequence, Z, is according to the invention an amino acid se- 
quence positioned between A and B or B and B, i.e. active sites, comprising a se- 
quence corresponding to SEQ ID NO: 11, or a functional equivalent thereof, wherein 
20 the functionality in particular relates to the presentation of the active site. Accordingly, 
in one embodiment of the [Z-B] alignment may be present in from 1 to 3 repeats, 
wherein B is individually selected from Cys-Gly-Pro-Cys or Cys-Gly-His-Gly or Cys- 
Ala-Thr-Cys or Cys-Pro-His-Cys or Cys-Val-Leu-Cys or Cys-Gly-Tyr-Cys. 

In a preferred aspect of the present invention the [Z-B] alignment is repeated at 
25 least twice. 

In one embodiment the protein variant is extracellularly expressible in large 
quantities. This is a requirement if the industrial application of the present protein 
variant is to be feasible. Accordingly, in a preferred embodiment Y is a truncated ver- 
sion of SEQ ID NO:2, thereby increasing the extracellular yield of the protein variant 
30 when recombinantly expressed and truncated versions are encompassed by the pre- 
sent invention as functional equivalents of the sequence. Preferably, Y is an amino 
acid sequence comprising a sequence corresponding to SEQ ID NO:2, SEQ ID 
NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ 
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ID NO:9 or SEQ ID NO: 10 or a functional equivalent thereof. In a more preferred em- 
bodiment of the invention Y comprises a sequence corresponding to SEQ ID NO:2. 

In the context of the invention the term a functional equivalent relates to a se- 
quence processing a corresponding property as the sequences mentioned in the 

5 present invention, but wherein one or more amino acids have been substituted with 
others. Preferably a functional equivalent contains conservative substitutions, i.e. 
where one or more amino acids are substituted by an amino acid having similar 
properties, such that a person skilled in the art of protein chemistry will expect the 
secondary and tertiary structure of the protein to be unchanged. Amino acids suitable 

10 for conservative substitutions include those having functionally similar side chains. 
For example, hydrophobic residues: e.g. glycine, alanine, valine, leucine, isoleucine 
and methionine may replace another such residue. Similarly, conservative substitu- 
tions may involve interchanging hydrophilic residues: (e.g.: arginine and lysine, 
glutamine and aspargine, threonine and serine), basic reduces (e.g., lysine, arginine 

15 and histidine), and/or acidic residues (e.g., aspartic acid and glutamic acid). Func- . 
tional equivalents may also, or alternatively, be modified by for example the deletion 
or addition of amino acids, or the chemical modification of amino acids, as long as 
the function of the protein is preserved. Furthermore a functional equivalent accord- 
ing to the invention may additionally relate to any truncated sequence having proper- 

20 ties identical to the sequences of the invention. 

As discussed above it is an object of the present invention to provide a protein 
variant having an increased reducing activity as compared to a parent PDI, e.g. a 
wild-type PDL The reducing activity, i.e. property, may e.g. be more than 25%, 50%, 
75%, 100% 150%, 200% or 250% of the parent protein (e.g. wild-type). The specific 

25 activity may be increased compared to the parent protein. The specific activity ac- 
cording to the present invention may be defined as activity in insulin reduction units 
per mg. of protein and may be estimated using the insulin reduction assay described 
by Bardwell, J. C. A et al. (Cell 67, pp. 581-589, 1991). 

30 The invention also relates to a fusion polypeptide comprising a polypeptide of 

the invention and a fusion partner. The fusion protein may e.g. comprise a polypep- 
tide as defined by the invention, and another protein fragment, wherein said other 
protein fragment is capable of facilitating expression and purification of the polypep- 
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tide, optionally by reducing the susceptibility of the protein variant to enzymatic deg- 
radation. The fusion partner may be a signal peptide, such as, e.g., amino acid num- 
ber 1-20 of SEQ IDNO:13. 

5 The term polypeptide means a polymer of amino acids and may also be termed 

protein. The polypeptide may consist of a single polypeptide chain (monomeric) or 
comprise several associated polypeptides (multimeric, e.g. dimeric). In further em- 
bodiments of the invention, the polypeptide of the invention consist of at most 600 
amino acids, such as at most 400 amino acids, such as, at most 350, at most 330, at 
10 most 300, at most 285, at most 281 amino acids or at most 115 amino acids. 

Nucleotide sequences encoding the polypeptides of the invention 

The present invention also relates to isolated nucleotide sequences which en- 
code a polypeptide of the present invention. In a preferred embodiment, the nucleo- 
15 tide sequence is SEQ ID NO: 14 or SEQ ID NO: 16. The present invention also en- 
compasses a nucleotide sequence which encode a polypeptide having the amino 
acid sequence SEQ ID NO: 1 5, SEQ ID NO: 1 7 or amino acid number 21 -281 of SEQ 
ID NO:15. 

The nucleotide sequence of the parent PDI may be isolated or cloned as known 
20 in the art and include isolation from genomic DNA, preparation from cDNA, or a com- 
bination thereof. Thus, the invention relates to a nucleic acid comprising a nucleotide 
sequence encoding the polypeptide of the invention, the nucleotide sequence may 
optionally be linked to one or more control sequences that direct the production of the 
polypeptide in a suitable expression host. The cloning of the nucleotide sequence 
25 coding for the parent PDI can be effected, e.g., by using the well known polymerase 
chain reaction (PCR) or antibody screening of expression libraries to detect cloned 
DNA fragments with shared structural features. See, e.g., Innis et a/., 1990, PCR: A 
Guide to Methods and Application, Academic Press, New York. Other nucleic acid 
amplification procedures such as ligase chain reaction (LCR), ligated activated tran- 
30 scription (LAT) and nucleotide sequence-based amplification (NASBA) may be used. 

The nucleotide sequence encoding a parent PDI may be isolated from any 
cell or microorganism producing the PDI in question, using various methods well 
known in the art. First, a genomic DNA and/or cDNA library may be constructed using 
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chromosomal DNA or mRNA from the organism that produces the PDI. Then, if the 
amino acid sequence of the PDI is known, labelled oligonucleotide probes may be 
synthesized and used to identify PDI-encoding clones from a genomic library pre- 
pared from the organism in question. Alternatively, a labelled oligonucleotide probe 

5 containing sequences homologous to another known PDI-gene could be used as a 
probe to identify PDI-encoding clones, using hybridization and washing conditions of 
lower stringency. Yet another method for identifying PDI-encoding clones would in- 
volve inserting fragments of genomic DNA into an expression vector, such as a 
plasmid, transforming PDI-negative bacteria with the resulting genomic DNA library, 

10 and then plating the transformed bacteria onto agar containing a substrate for PDI, 
thereby allowing clones expressing the PDI to be identified. 

Alternatively, the nucleotide sequence encoding the enzyme may be prepared 
synthetically by established standard methods, e.g. the phosphoroamidite method 
described S.L Beaucage and M.H. Caruthers, (1981), Tetrahedron Letters 22, p. 

15 1859-1869, or the method described by Matthes et al., (1984), EMBO J. 3, p. 801- 
805. In the phosphoroamidite method, oligonucleotides are synthesized, e.g. in an 
automatic DNA synthesizer, purified, annealed, ligated and cloned in appropriate vec- 
tors. The nucleotide sequence may be of genomic, cDNA, RNA, semisynthetic, syn- 
thetic origin, or any combinations thereof. The nucleotide sequence, e.g. a DNA se- 

20 quence, may be of mixed genomic and synthetic origin, mixed synthetic and cDNA 
origin or mixed genomic and cDNA origin, prepared by ligating fragments of syn- 
thetic, genomic or cDNA origin (as appropriate, the fragments corresponding to vari- 
ous parts of the entire DNA sequence), in accordance with standard techniques. The 
DNA sequence may also be prepared by polymerase chain reaction (PCR) using 

25 specific primers, for instance as described in US 4,683,202 or R.K. Saiki et al., 
(1988), Science 239, 1988, pp. 487-491. Alternative methods for providing polypep- 
tides of the invention include gene-shuffling method known in the art including the 
methods, e.g., described in WO 95/22625 and WO 96/00343. 

The introduction of a mutation into the nucleotide sequence to exchange one 

30 nucleotide for another nucleotide may be accomplished by site-directed mutagenesis 
using any of the methods known in the art. Particularly useful is the procedure which 
utilizes a supercoiled, double stranded DNA vector with an insert of interest and two 
synthetic primers containing the desired mutation. The oligonucleotide primers, each 
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complementary to opposite strands of the vector, extend during temperature cycling 
by means of Pfu DNA polymerase. On incorporation of the primers, a mutated plas- 
mid containing staggered nicks is generated. Following temperature cycling, the 
product is treated with Dpn\ which is specific for methylated and hemimethylated 
5 DNA to digest the parental DNA template and to select for mutation-containing syn- 
thesized DNA. Other procedures known in the art may also be used. For a general 
description of nucleotide substitution, see, e.g., Ford et a/., 1991, Protein Expression 
and Purification 2: 95-107. Reference is also made to Morinaga et al., (1984), Bio- 
technology 2, pp. 646-639. US 4,760,025 disclose the introduction of oligonucleoti- 

10 des encoding multiple mutations by performing minor alterations of the cassette. An- 
other method for introducing mutations into PDI-encoding DNA sequences is de- 
scribed in Nelson and Long, (1989), Analytical Biochemistry 180, p. 147-151. 

The present invention also relates to nucleic acid constructs comprising a nu- 
cleotide sequence of the present invention operably linked to one or more control 

15 sequences which direct the expression of the coding sequence in a suitable host cell 
under conditions compatible with the control sequences. Expression will be under- 
stood to include any step involved in the production of the polypeptide including, but 
not limited to, transcription, post-transcriptional modification, translation, post- 
radiational modification, and secretion. 

20 The term nucleic acid construct is synonymous with the term expression cas- 
sette when the nucleic acid construct contains all the control sequences required for 
expression of a coding sequence of the present invention. 

The term "coding sequence" is defined herein as a nucleotide sequence which 
directly specifies the amino acid sequence of its protein product. The boundaries of a 

25 genomic coding sequence are generally determined by a ribosome binding site (pro- 
karyotes) or by the ATG start codon (eukaryotes) located just upstream of the open 
reading frame at the 5' end of the mRNA and a transcription terminator sequence 
located just downstream of the open reading frame at the 3' end of the mRNA. A cod- 
ing sequence can include, but is not limited to, DNA, cDNA, and recombinant nucleo- 

30 tide sequences. 

The term "control sequences" is defined herein to include all components which 
are necessary or advantageous for the expression of a polypeptide of the present 
invention. Each control sequence may be native or foreign to the nucleotide se- 
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quence encoding the polypeptide. Such control sequences include, but are not lim- 
ited to, a leader, polyadenylation sequence, propeptide sequence, promoter, signal 
peptide sequence, and transcription terminator. At a minimum, the control sequences 
include a promoter, and transcriptional and translational stop signals. The control se- 

5 quences may be provided with linkers for the purpose of introducing specific restric- 
tion sites facilitating ligation of the control sequences with the coding region of the 
nucleotide sequence encoding a polypeptide. The term "operably linked" is defined 
herein as a configuration in which a control sequence is appropriately placed at a 
position relative to the coding sequence of the DNA sequence such that the control 

10 sequence directs the expression of a polypeptide. 

The control sequence may be an appropriate promoter sequence, a nucleotide 

♦ 

sequence which is recognized by a host cell for expression of the nucleotide se- 
quence. The promoter sequence contains transcriptional control sequences which 
mediate the expression of the polypeptide. The promoter may be any nucleotide se- 
15 quence which shows transcriptional activity in the host cell of choice including mu- 
tant, truncated, and hybrid promoters, and may be obtained from genes encoding 
extracellular or intracellular polypeptides either homologous or heterologous to the 
host cell. 

Examples of suitable promoters for directing the transcription of the nucleic acid 
20 constructs of the present invention, especially in a bacterial host cell, are the promot- 
ers obtained from the £. coli lac operon, Streptomyces coelicolor agarase gene 
(dagA), Bacillus subtilis levansucrase gene (sacB) t Bacillus Hcheniformis alpha- 
amylase gene {amyL), Bacillus stearothermophilus maltogenic amylase gene 
(amyM), Bacillus amyloliquefaciens alpha-amylase gene (amyQ), Bacillus licheni- 
25 formis penicillinase gene (penP), Bacillus subtilis xylA and xylB genes, and prokary- 
otic beta-lactamase gene (Villa-Kamaroff et a/., 1978, Proceedings of the National 
Academy of Sciences USA 75: 3727-3731), as well as the tac promoter (DeBoer et 
aA, 1983, Proceedings of the National Academy of Sciences USA 80: 21-25). Further 
promoters are described in "Useful proteins from recombinant bacteria" in Scientific 
30 American, 1980, 242: 74-94; and in Sambrook et a/., 1989, supra. 

Examples of suitable promoters for directing the transcription of the nucleic acid 
constructs of the present invention in a filamentous fungal host cell are promoters 
obtained from the genes for Aspergillus oryzae TAKA amylase, Rhizomucor miehei 



♦ 
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aspartic proteinase, Aspergillus niger neutral alpha-amylase, Aspergillus niger acid 
stable alpha-amylase, Aspergillus niger or Aspergillus awamori glucoamylase {glaA) t 
Rhizomucor miehei lipase, Aspergillus oryzae alkaline protease, Aspergillus oryzae 
triose phosphate isomerase, Aspergillus nidulans acetamidase, and Fusarium ox- 
5 ysporum trypsin-like protease (WO 96/00787), as well as the NA2-tpi promoter (a 
hybrid of the promoters from the genes for Aspergillus niger neutral alpha-amylase 
and Aspergillus oryzae triose phosphate isomerase), and mutant, truncated, and hy- 
brid promoters thereof. 

In a yeast host, useful promoters are obtained from the genes for Saccharomy- 

10 ces cerevisiae enolase (ENO-1), Saccharomyces cerevisiae galactokinase (GAL1), 
Saccharomyces cerevisiae alcohol dehydrogenase/glyceraldehyde-3-phosphate de- 
hydrogenase (ADH2/GAP), and Saccharomyces cerevisiae 3-phosphoglycerate 
kinase. Other useful promoters for yeast host cells are described by Romanos et a/., 
1992, Yeast 8: 423-488. 

15 The control sequence may also be a suitable transcription terminator sequence, 
a sequence recognized by a host cell to terminate transcription. The terminator se- 
quence is operably linked to the 3' terminus of the nucleotide sequence encoding the 
polypeptide. Any terminator that is functional in the host cell of choice may be used in 
the present invention. 

20 The control sequence may also be a suitable leader sequence, a nontranslated 
region of an mRNA which is important for translation by the host cell. The leader se- 
quence is operably linked to the 5' terminus of the nucleotide sequence encoding the 
polypeptide. Any leader sequence that is functional in the host cell of choice may be 
used in the present invention. 

25 The control sequence may also be a polyadenylation sequence, a sequence 

operably linked to the 3' terminus of the nucleotide sequence and which, when tran- 
scribed, is recognized by the host cell as a signal to add polyadenosine residues to 
transcribed mRNA. Any polyadenylation sequence which is functional in the host cell 
of choice may be used in the present invention. 

30 To allow the secretion of the expressed protein variant, i.e. the extracellular ex- 
pression, the nucleic acid construct may include a signal sequence, inserted prior to 
the coding sequence. Thus, the control sequence may also be a signal peptide cod- 
ing region that codes for an amino acid sequence linked to the amino terminus of a 
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polypeptide and directs the encoded polypeptide into the cell's secretory pathway. 
The 5' end of the coding sequence of the nucleotide sequence may inherently con- 
tain a signal peptide coding region naturally linked in translation reading frame with 
the segment of the coding region which encodes the secreted polypeptide. Alterna- 

5 tively, the 5' end of the coding sequence may contain a signal peptide coding region 
which is foreign to the coding sequence, The foreign signal peptide coding region 
may be required where the coding sequence does not naturally contain a signal pep- 
tide coding region. Alternatively, the foreign signal peptide coding region may simply 
replace the natural signal peptide coding region in order to enhance secretion of the 

10 polypeptide. However, any signal peptide coding region which directs the expressed 
polypeptide into the secretory pathway of a host cell of choice may be used in the 
present invention. 

Effective signal peptide coding regions for filamentous fungal host cells are the 
signal peptide coding regions obtained from the genes for Aspergillus oryzae TAKA 
15 amylase, Aspergillus niger neutral amylase, Aspergillus niger glucoamylase, Rhi- 
zomucor miehei aspartic proteinase, Humicola insolens cellulase, and Humicola la- 
nuginosa lipase. 

The constructs further contain one or more exons of the endogenous gene. An 
exon is defined as a DNA sequence which is copied into RNA and is present in a ma- 

20 ture mRNA molecule such that the exon sequence is in-frame with the coding region 
of the endogenous gene. The exons can, optionally, contain DNA which encodes one 
or more amino acids and/or partially encodes an amino acid. Alternatively, the exon 
contains DNA which corresponds to a 5' non-encoding region. Where the exogenous 
exon or exons encode one or more amino acids and/or a portion of an amino acid, 

25 the nucleic acid construct is designed such that, upon transcription and splicing, the 
reading frame is in-frame with the coding region of the endogenous gene so that the 
appropriate reading frame of the portion of the mRNA derived from the second exon 
is unchanged. 

The splice-donor site of the constructs directs the splicing of one exon to an- 
30 other exon. Typically, the first exon lies 5' of the second exon, and the splice-donor 
site overlapping and flanking the first exon on its 3' side recognizes a splice-acceptor 
site flanking the second exon on the 5' side of the second exon. A splice-acceptor 
site, like a splice-donor site, is a sequence which directs the splicing of one exon to 
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another exon. Acting in conjunction with a splice-donor site, the splicing apparatus 
uses a splice-acceptor site to effect the removal of an intron. 

Expression of the polypeptides 

5 The present invention also relates to a vector comprising a nucleotide sequence 

of the invention, including recombinant expression vectors comprising the nucleotide 
sequence, a promoter, and transcriptional and translational stop signals. The various 
nucleic acid and control sequences described above may be joined together to pro- 
duce a recombinant expression vector which may include one or more convenient 

10 restriction sites to allow for insertion or substitution of the nucleotide sequence en- 
coding the polypeptide at such sites. Alternatively, the nucleotide sequence of the 
present invention may be expressed by inserting the nucleotide sequence or a nu- 
cleic acid construct comprising the sequence into an appropriate vector for expres- 
sion. In creating the expression vector, the coding sequence is located in the vector 

15 so that the coding sequence is operably linked with the appropriate control se- 
quences for expression. 

The recombinant expression vector may be any vector (e.g., a plasmid or virus) 
which can be conveniently subjected to recombinant DNA procedures and can bring 
about the expression of the nucleotide sequence. The choice of the vector will typi- 

20 cally depend on the compatibility of the vector with the host cell into which the vector 
is to be introduced. The vectors may be linear or closed circular plasmids. 

The vector may be an autonomously replicating vector, /.e., a vector which ex- 
ists as an extrachromosomal entity, the replication of which is independent of chro- 
mosomal replication, e.g., a plasmid, an extrachromosomal element, a minichromo- 

25 some, or an artificial chromosome. The vector may contain any means for assuring 
self-replication. Alternatively, the vector may be one which, when introduced into the 
host cell, is integrated into the genome and replicated together with the chromo- 
some^) into which it has been integrated. Furthermore, a single vector or plasmid or 
two or more vectors or plasmids which together contain the total DNA to be intro- 

30 duced into the genome of the host cell, or a transposon may be used. 

The vectors of the present invention preferably contain one or more selectable 
markers which permit easy selection of transformed cells. A selectable marker is a 
gene the product of which provides for biocide or viral resistance, resistance to heavy 
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metals, prototrophy to auxotrophs, and the like. Examples of bacterial selectable 
markers are the dal genes from Bacillus subtilis or Bacillus Hcheniformis, or markers 
which confer antibiotic resistance such as ampicillin, kanamycin, chloramphenicol or 
tetracycline resistance. The vector may comprise Aspergillus selection markers such 
5 as amdS, argB, niaD and sC, a marker giving rise to hygromycin resistance, or the 
selection may be accomplished by co-transformation, e.g. as described in WO 
91/17243. 

For autonomous replication, the vector may further comprise an origin of repli- 
cation enabling the vector to replicate autonomously in the host cell in question. 

10 More than one copy of a nucleotide sequence of the present invention may be 

inserted into the host cell to increase production of the gene product. An increase in 
the copy number of the nucleotide sequence can be obtained by integrating at least 
one additional copy of the sequence into the host cell genome or by including an am- 
plifiable selectable marker gene with the nucleotide sequence where cells containing 

15 amplified copies of the selectable marker gene, and thereby additional copies of the 
nucleotide sequence, can be selected for by cultivating the cells in the presence of 
the appropriate selectable agent. 

The procedures used to ligate the elements described above to construct the 
recombinant expression vectors of the present invention are well known to one skilled 

20 in the art (see, e.g., Sambrook et a/., 1989, supra). 

Host cells 

The present invention also relates to recombinant host cells, comprising a nu- 
cleotide sequence of the invention, which are advantageously used in the recombi- 

25 nant production of the polypeptides. A vector comprising a nucleotide sequence of 
the present invention is introduced into a host cell so that the vector may be main- 
tained as a chromosomal integrant or as a self-replicating extra-chromosomal vector 
as described earlier. The term "host ceir encompasses any progeny of a parent cell 
that is not identical to the parent cell due to mutations that occur during replication. 

30 The host cell may be a unicellular microorganism, e.g., a prokaryote, or a non- 
unicellular microorganism, e.g., a eukaryote. The host cell may be chosen from 
mammal, avian, insect or plant cells, or it may be selected from bacteria or fungi. 
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Useful unicellular cells are bacterial cells such as gram positive bacteria includ- 
ing, but not limited to, a Bacillus cell, e.g., Bacillus alkalophilus, Bacillus amylolique- 
faciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, Bacil- 
lus lautus, Bacillus lentus, Bacillus licheniformis, Bacillus megaterium, Bacillus 

5 stearothermophilus, Bacillus subtilis, and Bacillus thuringiensis\ or a Streptomyces 
cell, e.g., Streptomyces lividans and Streptomyces murinus, or gram negative bacte- 
ria such as E. coli and Pseudomonas sp. In a preferred embodiment, the bacterial 
host cell is a Bacillus lentus, Bacillus licheniformis, Bacillus stearothermophilus, or 
Bacillus subftlis cell. In another preferred embodiment, the Bacillus cell is an alkalo- 

10 philic Bacillus. 

The introduction of a vector into a bacterial host cell may, for instance, be ef- 
fected by protoplast transformation (see, e.g., Chang and Cohen, 1979, Molecular 
General Genetics 168: 111-115), using competent cells (see, e.g., Young and 
Spizizin, 1961, Journal of Bacteriology 81: 823-829, or Dubnau and Davidoff- 
15 Abelson, 1971 , Journal of Molecular Biology 56: 209-221), electroporation (see, e.g., 
Shigekawa and Dower, 1988, Biotechniques 6: 742-751), or conjugation (see, e.g., 
Koehlerand Thorne, 1987, Journal of Bacteriology 169: 5771-5278). 

In a preferred embodiment, the host cell is a fungal cell. "Fungi" as used herein 
includes the phyla Ascomycota, Basidiomycota, Chytridiomycota, and Zygomycota 
20 (as defined by Hawksworth et a/., In, Ainsworth and Bisby's Dictionary of The Fungi, 
8th edition, 1995, CAB International, University Press, Cambridge, UK) as well as the 
Oomycota (as cited in Hawksworth et a/., 1995, supra, page 171) and all mitosporic 
fungi (Hawksworth et a/., 1995, supra). 

In a more preferred embodiment, the fungal host cell is a yeast cell. "Yeast" as 

» 

25 used herein includes ascosporogenous yeast (Endomycetales), basidiosporogenous 
yeast, and yeast belonging to the Fungi Imperfecti (Blastomycetes). Since the classi- 
fication of yeast may change in the future, for the purposes of this invention, yeast 
shall be defined as described in Biology and Activities of Yeast (Skinner, F.A., Pass- 
more, S.M., and Davenport, R.R., eds, Soc. App. Bacterid Symposium Series No. 9, 

30 1 980). 

In an even more preferred embodiment, the yeast host cell is a Candida, Han- 
senula, Kluyveromyces, Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia 
cell. 
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In a most preferred embodiment, the yeast host cell is a Saccharomyces carls- 
bergensis, Saccharomyces cerevisiae, Saccharomyces diastaticus, Saccharomyces 
douglasii, Saccharomyces kluyveri, Saccharomyces norbensis or Saccharomyces 
oviformis cell. In another most preferred embodiment, the yeast host cell is a Kluy- 

5 veromyces lactis cell. In another most preferred embodiment, the yeast host cell is a 
Yarrowia lipolytica cell. 

In another more preferred embodiment, the fungal host cell is a filamentous 
fungal cell. "Filamentous fungi" include all filamentous forms of the subdivision 
Eumycota and Oomycota (as defined by Hawksworth et a/., 1995, supra). The fila- 

10 mentous fungi are generally characterized by a mycelial wall composed of chitin, cel- 
lulose, glucan, chitosan, mannan, and other complex polysaccharides. Vegetative 
growth is by hyphal elongation and carbon catabolism is obligately aerobic. In con- 
trast, vegetative growth by yeasts such as Saccharomyces cerevisiae is by budding 
of a unicellular thallus and carbon catabolism may be fermentative. 

15 In an even more preferred embodiment, the filamentous fungal host cell is a cell 
of a species of, but not limited to, Acremonium, Aspergillus, Fusarium, Humicola, 
Mucor, Myceliophthora, Neurospora, Penicillium, Thielavia, Tolypocladium, or 
Trichoderma. 

In a most preferred embodiment, the filamentous fungal host cell is an Aspergil- 
20 lus awamori, Aspergillus foetidus, Aspergillus japonicus, Aspergillus nidulans, Asper- 
gillus niger or Aspergillus oryzae cell. In another most preferred embodiment, the fil- 
amentous fungal host cell is a Fusarium bactridioides, Fusarium cerealis, Fusarium 
crookwellense, Fusarium culmorum, Fusarium graminearum, Fusarium graminum, 
Fusarium heterosporum, Fusarium negundi, Fusarium oxysporum, Fusarium reticula- 
25 turn, Fusarium roseum, Fusarium sambucinum, Fusarium sarcochroum, Fusarium 
sporotrichioides, Fusarium sulphureum, Fusarium torulosum, Fusarium trichothe- 
cioides, or Fusarium venenatum cell. In an even most preferred embodiment, the fil- 
amentous fungal parent cell is a Fusarium venenatum (Nirenberg sp. nov.) cell. In 
another most preferred embodiment, the filamentous fungal host cell is a Humicola 
30 insolens, Humicola lanuginosa, Mucor miehei, Myceliophthora thermophila, Neuro- 
spora crassa, Penicillium purpurogenum, Thielavia terrestris, Trichoderma har- 
zianum, Trichoderma koningii, Trichoderma longibrachiatum, Trichoderma reesei, or 
Trichoderma viride cell. 
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Fungal cells may be transformed by a process involving protoplast formation, 
transformation of the protoplasts, and regeneration of the cell wall in a manner known 
perse. Suitable procedures for transformation of Aspergillus host cells are described 
in EP 238 023 and Yelton et a/., 1984, Proceedings of the National Academy ofSci- 

5 ences USA 81: 1470-1474. Suitable methods for transforming Fusarium species are 
described by Malardier et a/., 1989, Gene 78: 147-156, and WO 96/00787. Yeast 
may be transformed using the procedures described by Becker and Guarente, In 
Abelson, J.N. and Simon, M.I., editors, Guide to Yeast Genetics and Molecular Biol- 
ogy, Methods in Enzymology, Volume 194, pp 182-187, Academic Press, Inc., New 

10 York; Ito et a/., 1983, Journal of Bacteriology 153: 163; and Hinnen et a/., 1978, Pro- 
ceedings of the National Academy of Sciences USA 75: 1920. 



Methods of producing the polypeptides of the invention 

In the production methods of the present invention, the cells are cultivated in a 
15 nutrient medium suitable for production of the polypeptide using methods known in 
the art. For example, the cell may be cultivated by shake flask cultivation, and small- 
scale or large-scale fermentation (including continuous, batch, fed-batch, or solid 
state fermentations) in laboratory or industrial fermentors performed in a suitable me- 
dium and under conditions allowing the polypeptide to be expressed and/or isolated. 
20 The cultivation takes place in a suitable nutrient medium comprising carbon and ni- 
trogen sources and inorganic salts, using procedures known in the art. Suitable me- 
dia are available from commercial suppliers or may be prepared according to pub- 
lished compositions (e.g., in catalogues of the American Type Culture Collection). If 
the polypeptide is secreted into the nutrient medium, the polypeptide can be recov- 
25 ered directly from the medium. If the polypeptide is not secreted, it can be recovered 
from cell lysates. 

The resulting polypeptide may be recovered by methods known in the art. For 
example, the polypeptide may be recovered from the nutrient medium by conven- 
tional procedures including, but not limited to, centrifugation, filtration, extraction, 
30 spray-drying, evaporation, or precipitation. Polypeptide may be recovered from the 
medium by conventional procedures including separating the cells from the medium 
by centrifugation or filtration. If necessary a purification step may be carried out, for 
example ion exchange chromatography, affinity chromatography or the like. Thus, the 
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polypeptides of the present invention may be purified by a variety of procedures 
known in the art including, but not limited to, chromatography (e.g., ion exchange, 
affinity, hydrophobic, chromatofocusing, and size exclusion), electrophoretic proce- 
dures (e.g., preparative isoelectric focusing), differential solubility (e.g., ammonium 

5 sulfate precipitation), SDS-PAGE, or extraction (see, e.g., Protein Purification, J.C. 
Janson and Lars Ryden, editors, VCH Publishers, New York, 1989). 

The present invention also relates to a transgenic plant, plant part, or plant cell 
which has been transformed with a nucleotide sequence encoding a polypeptide of 
the invention so as to express and produce the polypeptide in recoverable quantities. 

10 The polypeptide may be recovered from the plant or plant part. Alternatively, the plant 
or plant part containing the recombinant polypeptide may be used as such for improv- 
ing the quality of food or feed, e.g., improving •nutritional value, palatability, and 
rheological properties, or to destroy an antinutritive factor. 

The transgenic plant can be dicotyledonous (a dicot) or monocotyledonous (a 

15 monocot). Examples of monocot plants are grasses, such as meadow grass (blue 
grass, Poa), forage grass such as festuca, lolium, temperate grass, such as Agrostis, 
and cereals, e.g., wheat, oats, rye, barley, rice, sorghum, and maize (corn). Exam- 
ples of dicot plants are tobacco, legumes, such as lupins, potato, sugar beet, pea, 
bean and soybean, and cruciferous plants (family Brassicaceae), such as cauliflower, 

20 rape seed, and the closely related model organism Arabidopsis thaliana. 

Examples of plant parts are stem, callus, leaves, root, fruits, seeds, and tubers. 
Also specific plant tissues, such as chloroplast, apoplast, mitochondria, vacuole, per- 
oxisomes, and cytoplasm are considered to be a plant part. Furthermore, any plant 
cell, whatever the tissue origin, is considered to be a plant part. Also included within 

25 the scope of the present invention are the progeny of such plants, plant parts and 
plant cells. 

The transgenic plant or plant cell expressing a polypeptide of the present inven- 
tion may be constructed in accordance with methods known in the art. Briefly, the 
plant or plant cell is constructed by incorporating one or more expression constructs 
30 encoding a polypeptide of the present invention into the plant host genome and 
propagating the resulting modified plant or plant cell into a transgenic plant or plant 
cell. 



» 
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Conveniently, the expression construct is a nucleic acid construct which com- 
prises a nucleotide sequence encoding a polypeptide of the present invention opera- 
bly linked with appropriate regulatory sequences required for expression of the nu- 
cleotide sequence in the plant or plant part of choice. Furthermore, the expression 
5 construct may comprise a selectable marker useful for identifying host cells into 
which the expression construct has been integrated and DNA sequences necessary 
for introduction of the construct into the plant in question (the latter depends on the 
DNA introduction method to be used). 

The choice of regulatory sequences, such as promoter and terminator se- 

10 quences and optionally signal or transit sequences is determined, for example, on 
the basis of when, where, and how the polypeptide is desired to be expressed. For 
instance, the expression of the gene encoding a polypeptide of the present invention 
may be constitutive or inducible, or may be developmental, stage or tissue specific, 
and the gene product may be targeted to a specific tissue or plant part such as seeds 

15 or leaves. Regulatory sequences are, for example, described by Tague et a/., 1988, 
Plant Physiology 86: 506. 

For constitutive expression, the 35S-CaMV promoter may be used (Franck et 
a/., 1980, Cell 21: 285-294). Organ-specific promoters may be, for example, a pro- 
moter from storage sink tissues such as seeds, potato tubers, and fruits (Edwards & 

•20 Coruzzi, 1990, Ann. Rev. Genet 24: 275-303), or from metabolic sink tissues such as 
meristems (Ito et a/., 1994, Plant Mol. Biol. 24: 863-878), a seed specific promoter 
such as the glutelin, prolamin, globulin, or albumin promoter from rice (Wu et a/., 
1998, Plant and Cell Physiology 39: 885-889), a Vicia faba promoter from the legu- 
min B4 and the unknown seed protein gene from Vicia faba (Conrad et a/., 1998, 

25 Journal of Plant Physiology 152: 708-71 1), a promoter from a seed oil body protein 
(Chen et a/., 1998, Plant and Cell Physiology 39: 935-941), the storage protein napA 
promoter from Brassica napus, or any other seed specific promoter known in the art, 
e.g., as described in WO 91/14772. Furthermore, the promoter may be a leaf specific 
promoter such as the rbcs promoter from rice or tomato (Kyozuka et a/., 1993, Plant 

30 Physiology 102: 991-1000, the chlorella virus adenine methyltransferase gene pro- 
moter (Mitra and Higgins, 1 994, Plant Molecular Biology 26: 85-93), or the aldP gene 
promoter from rice (Kagaya et a/., 1995, Molecular and General Genetics 248: 668- 



WO 00/70064 



PCT/DK00/00265 



27 

674), or a wound inducible promoter such as the potato pin2 promoter (Xu et a/., 
1993, Plant Molecular Biology 22: 573-588). 

A promoter enhancer element may also be used to achieve higher expression 
of the enzyme in the plant. For instance, the promoter enhancer element may be an 
5 intron which is placed between the promoter and the nucleotide sequence encoding 
a polypeptide of the present invention. For instance, Xu et a/., 1993, supra disclose 
the use of the first intron of the rice actin 1 gene to enhance expression. 

The selectable marker gene and any other parts of the expression construct 
may be chosen from those available in the art. 

10 The nucleic acid construct is incorporated into the plant genome according to 

conventional techniques known in the art, including Agrobacterium-mediaXed trans- 
formation, virus-mediated transformation, microinjection, particle bombardment, bio- 
listic transformation, and electroporation (Gasser et a/., 1990, Science 244: 1293; 
Potrykus, 1990, Bio/Technology 8: 535; Shimamoto ef a/., 1989, Nature 338: 274). 

15 Presently, Agrobacterium tumefaciens-rnediaXed gene transfer is the method of 

choice for generating transgenic dicots (for a review, see Hooykas and Schilperoort, 
1992, Plant Molecular Biology 19: 15-38). However, it can also be used for transform- 
ing monocots, although other transformation methods are generally preferred for 
these plants. Presently, the method of choice for generating transgenic monocots is 

20 particle bombardment (microscopic gold or tungsten particles coated with the trans- 
forming DNA) of embryonic calli or developing embryos (Christou, 1992, Plant Jour- 
nal 2: 275-281; Shimamoto, 1994, Current Opinion Biotechnology 5: 158-162; Vasil 
et a/., 1992, Bio/Technology 10: 667-674). An alternative method for transformation 
of monocots is based on protoplast transformation as described by Omirulleh et a/., 

25 1 993, Plant Molecular Biology 2 1 : 4 1 5-428. 

Following transformation, the transformants having incorporated therein the ex- 
pression construct are selected and regenerated into whole plants according to 
methods well-known in the art. 

The present invention also relates to methods for producing a polypeptide of the 

30 present invention comprising (a) cultivating a transgenic plant or a plant cell 
comprising a nucleotide sequence encoding a polypeptide capable of reducing 
disulfide bonds of the present invention under conditions conducive for production of 
the polypeptide; and (b) recovering the polypeptide. 



WO 00/70064 



PCT/DKOO/00265 



28 

Properties of the polypeptides of the invention 

The polypeptides of the invention preferably have an improved capability of 
breaking protein disulfide bonds. Of particular interest is a polypeptide, capable of 

5 being produced in large quantities and having a redox potential which is lower than 
that of SEQ ID NO: 13 (e.g. without the signal peptide) or which is lower than that of a 
subsequence of SEQ ID NO: 13. In other embodiment, the polypeptide of the inven- 
tion has a redox potential which is lower than that of the amino acid sequence of 
amino acid number 1-281 of SEQ ID NO: 13 or which is lower than that of the amino 

10 acid sequence of amino acid number 21-281 of SEQ ID NO:13. 

In one embodiment the polypeptide of the invention has a redox potential which 
is decreased with at least 10mV, at least 20mV, at least 30mV or at least 50mV com- 
pared to the protein disulfide isomerase having the amino acid sequence SEQ ID 
NO:13 (e.g. without the signal peptide) or a subsequence of SEQ ID NO:13, such as 

15 amino acid number 21-281 of SEQ ID NO: 13. In further embodiments the polypeptide 
of the invention has a redox potential of less than -200mV, such as less than - 
220mV, less than -240mV, or less than -250mV. In one embodiment wherein the 
polypeptide is a variant of a parent PDI, the variant polypeptide has a redox potential 
which is lower than that of the parent disulfide isomerase, such as e.g. at least, 10%, 

20 or at least 20% lower. The lower redox potential means that the polypeptide has 
higher reducing properties than that of the parent disulfide isomerase. The polypep- 
tide of the invention may have a redox potential which is e.g. at least 30 mV or at 
least 50mV lower as compared to the redox potential of the parent protein disulfide 
isomerase, such as, e.g. a redox potential which is in the range of 30-80 mV or 30- 

25 50mV lower as compared to the redox potential of the parent protein disulfide isom- 
erase . 

The redox potential may be determined as described in J. Lundstrom, et al 
(1992) J. Biol. Chem. 267, 9047 - 9052. 

The redox potential of an enzymes capable of reducing disulfide bonds is a 
30 measure of whether the reaction has equilibrium to the right (higher redox potential) 
or to the left (lower redox potential) in the following reaction: 
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Cys-Xxx-Yyy-Cys = Cys-Xxx-Yyy-Cys + 2e" + 2H + . 

\ / \ / 

SH HS S-S 

5 The following table disclose standard redox potentials for the reaction. 



Molecule 


Source 


Sequence 


Redox potential 


Reference 


PDI 


Bovine 


CGHC 


-1 90 mV 


d) 


Thioredoxin 
(Trx) 


E. coli 


CGPC 


-270 mV 


(2) 


P34H Trx 


E. coli 


CGHC 


-235 mV 


(2) 


Thioredoxin 
reductase 


E. coli 


CATC 


-250 mV 


(3) 
(4) 


DsbA 


E. coli 


CPHC 


-89 mV 


(5) 


CaBP1 


Rat 


CGHC 


(about -200 mV) 


(2) 



(1) Lundstrbm J. and A. Holmgren (1993) Biochemistry 32, 6649 -6655. 

(2) Lundstrdm-Ljung, J. et. al (1995) FEBS Letters 357, 305 - 308. 
10 (3) Russel M and P Model (1988): J. Biol. Chem. 263, 9015-9019. 

(4) Siedler, F. et al (1993) Biochemistry 32, 7488-7495. 
' (5) Wunderlich M. and R. Glockshuber (1993): Protein Science 2, 717 -726 

Use of the polypeptide of the invention 

15 The present invention may be applied to a number of industrial fields. Allergy 

towards certain food items is an increasing concern to many people. Accordingly, the 
polypeptides of the present invention may be applied for the use as an allergen re- 
ducing agent. 

In particular, the polypeptides may be used as an allergen reducing agent in 
20 food, such as gluten or milk. By applying the polypeptides of the present invention to 
various food or feed products before, during, or after fabrication, disulfide bonds will 
be reduced and thus an allergic reaction in susceptible individuals avoided. 

Another area of particular interest in relation to the present invention is the in- 
creasing allergy in people, i.e. infants, toward milk supplements. A common practice 
25 of destabilising allergy promoting proteins in milk is that of heat treatment. However, 
heat treatment may reduce only some of the allergens, and has the unfortunate side 
effect of at the same time reducing the nutritional value of the milk. By using the 
polypeptides disclosed by the present invention it is possible to increase the suscep- 
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tibility of milk proteins, such as ct-lactalbumin, to the degradation by the enzyme tryp- 
sin without having to heat treat the milk under high temperature conditions. The poly- 
peptides may be used for reducing allergens in food or feed, e.g. gluten or milk 
based products, including beverages, such as infant formula and dietary drinks. 

5 It therefore follows that the present invention presents an advantage of reducing 

allergens in milk and at the same time preserve its nutritional value, in addition to 
having the benefit of being manufactured on a large industrial scale. 

In another aspect of the present invention a preferred embodiment is applying 
the polypeptides of the invention to the baking industry for the reduction of allergens 

10 in gluten. Like milk allergies many people suffer from gluten intolerance and this fact 
signifies yet another vast industrial application of the invention. 

Further aspects is use of the polypeptides of the invention for increasing the 
digestibility of food or feed, such as, e.g. for increasing the digestibility of milk or 
wheat based food or feed products. The polypeptides may also be used in the manu- 

15 facturing of a cosmetic product and contemplated are also cosmetic products com- 
prising a polypeptide of the invention. 

Yet another aspect of the present invention is the use of the polypeptides for the 
treatment or degradation of selenoproteins, the treatment and cleaning of fabrics, ad- 
ditives to detergents and pharmaceutical preparations for the treatment of eye suffer- 

20 ings. 

According to the invention it is further envisaged that the gene of the polypep- 
tides of the present invention may be expressed in plants or animals for the pre- 
treatment of allergens before processing the plants or animals into products. 

The present invention also relates to compositions comprising the polypeptide 
25 according to the invention. The content of the polypeptide per gram of composition 
depends on the use of the composition. However, the compositions may suitably 
comprise 0,01 - 1,00 mg of polypeptide per g, preferably 0,05 - 0,1 mg of polypep- 
tide per gram. 

Further to the invention the composition comprising the polypeptide of the in- 
30 vention may have any suitable form, such as the form of a granulate, a stabilised liq- 
uid or a protected enzyme. The composition may also comprise a suitable redox 
partner, such as an organic or inorganic reductant. 
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For many of the applications mentioned above it may be convenient that the 
composition may comprise at least one other enzyme than the polypeptide of the in- 
vention, such as a protease, an amylase, a lipase, a hydrolase, a peroxidase, a cellu- 
lase, a transglutaminase, a glucose oxidase, a xylanase, a pectin methyl esterase or 
5 a pectin lyase. In particular, in connection with reducing allergens in milk the compo- 
sition may comprise other enzymes, such as proteases and hydrolases. 

The invention also relates to food additives or cosmetic products comprising a 
polypeptide of the invention. 

10 The present invention is further described by the following examples which 

should not be construed as limiting the scope of the invention. 

EXAMPLES 

15 

The following are examples of the construction of a reducing protein variant trunca- 
tion. 

EXAMPLE 1: Construction of the reducing variant H61P PDI truncation 

20 

a) Construction of the expression plasmid pCaHj 527 : 

The Aspergillus oryzae expression plasmid pCaHj 483 (cf. WO 98/00529) con- 
sists of an expression cassette based on the Aspergillus niger neutral amylase II 
promoter fused to the Aspergillus nidulans those phosphate isomerase non trans- 

25 lated leader sequence (Pna2/tpi) and the Aspergillus niger amyloglycosidase termi- 
nater (Tamg). Also present on the plasmid is the Aspergillus selective marker amdS 
from Aspergillus nidulans enabling growth on acetamide as sole nitrogen source. 
These elements are cloned into the £. coli vector pUC19. The ampecillin resistance 
marker enabling selection in E. coli of this plasmid was replaced with the URA3 

30 marker of Saccharomyces cerevisiae that can complement a pyrF mutation in E. coli 
in the following way: 

The pUC 19 origin of replication was PCR amplified from pCaHj483 with the 
primers: 

142779: TTG AAT TGA AAA TAG ATT GAT TTA AAA CTT C (SEQ ID NO: 1 9) 
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142780: TTG CAT GCG TAA TCA TGG TCA TAG C (SEQ ID NO:20) 

The primer 142780 introduces a Bbu I site in the PGR fragment. 

The Expand PCR system (Roche Molecular Biochemicals, Basel, Schweitzer- 
land) was used for the amplification following the manufacturers instructions for this 

5 and the subsequent PCR amplifications. 

The URA3 gene was amplified from the general S. cerevisiae cloning vector 
pYES2 (Invitrogen corporation, Carlsbad, Ca, USA) using the primers: 
140288: TTG AAT TCA TGG GTA ATA ACT GAT AT (SEQ ID NO:21 ) 

142778: AAA TCA ATC TAT TTT CAA TTC AAT TCA TCA TT (SEQ ID NO:22) 

10 The primer 140288 introduces an EcoR I site in the PCR fragment. 

The two PCR fragments were fused by mixing them and amplifying using the 
primers 142780 and 140288 using the splicing by overlap method (Horton et. al, 
(1989), Gene, 77, 61-68). The resulting segment was digested by EcoR I and Bbu I 
and ligated to the largest fragment of pCaHj 483 being digested by the same en- 

15 zymes. The ligation mixture was used to transform the pyrF E. coli strain DB6507 
(ATCC 35673) made competent by the method of Mandel and Higa (Mandel, M. and 
Higa, A. (1970), J. MoL Biol. 45, 154). Transformants were selected on solid M9 me- 
dium (Sambrook et. al (1989) Molecular cloning, a laboratory manual, 2. edition, Cold 
Spring Harbor Laboratory Press) supplemented with 1 g/l casamino acids, 500 jag/l 

20 thiamine and 10 mg/l kanamycin. 

A plasmid from such a transformant was called pCaHj 527 and is outlined in 
Figure 1 . 

b) Construction of a truncated PDI : 
25 This section "b)" is also disclosed in WO 95/00636 (Novo Nordisk A/S). 

Cloning of Aspergillus oryzae and Aspergillus niger PDI encoding genes 

1.1. Design of oligo nucleotides for PCR amplification 

PDI from different organisms are highly homologous especially near the active site 
30 residues. In Fig.7, the following 7 PDI gene products were aligned: 

Bovine (Bos taurus) PDI (Yamauchi et al., Biochem. Biophys. Res. Commun. 146:1485- 

1492. 1987) , Chicken (Gallus gallus) PDI (Parkkonen et aL, Biochem. J. 256:1005- 

1011. 1988) , Human (Homo sapiens) PDI (Rapilajaniemi et al. EMBO J. 6:643-649, 
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1987), Mouse (Mus musculus) PDI (Gong, et al., Nucleic Acids Res. 16:1203, 1988), 
Rabbit (Oryctolagus cuniculus) PDI (Fliege! et al., J. Biol. Chem. 265:15496-15502, 
1990), Rat (Rattus norvegicus) PDI (Edman et al., Nature 317:267-270, 1985), Yeast 
(Saccharomyces cerevisiae) PDI (Tachikawa et al., J. Biochem. 1 10:306-313). 
5 Each subunit contains two active centres (Freedman et al., Cell 57:1069-1072, 

1989) and the homology in the surroundings of these active centres are particularly 
strong. A consensus amino acid sequence for the active centre closest to the N- 
terminus was determined from the alignment as -APWCGHCK- and an oligo 
deoxyribonucleotide encoding the peptide -WCGHCK- and extended with an EcoRI site 
10 in the 5* end, was synthesized: 

5TGGAATTCTGGTGYGGNCAYTGYAA3* (primer 4762, 25 nucleotides, 32 species) 

(Y=C or T; R=A or G; N=A, T, C, or G). 

A consensus amino acid sequence for the active centre closest to the C-terminus 
was determined: -YAPWCGHCK- and an oligo deoxyribonucleotide encoding the 
15 peptide -YAPWCG- in antisense and extended with a BamHI site in the 5' end was 
synthesized: 

5TGGGATCCRCACCANGGNGCRTA3* (primer 4763, 23 nucleotides, 64 species). 

These oligo deoxyribonucleotides (primers 4762 and 4763) were used as primers in a 
20 PCR reaction to amplify PDI-encoding gene fragments from genomic DNA from A. 
oryzae and A. niger 

1.2 Amplification and cloning of fragments of PDI-encoding genes. 

Genomic DNA was prepared from Aspergillus oryzae IFO 4177 and Aspergillus 
25 niger A524 as described by Yelton et al. (Proc. Natl. Acad. Sci. USA 81:1470-1474, 
1984). 

PCR reaction mixtures contained Taq DNA polymerase buffer supplied by 
Clontech laboratories Inc. and diluted as described, 250 of each of dATP, dCTP, 
dGTP, and, dTTP, 100 pmol of each of primers 4762 and 4763, and 0.5 pg of genomic 
30 DNA of either A. niger or A. oryzae. The total reaction volume was 0.1 ml, and it was 
covered with 0.05 ml paraffin oil. 

The following program was run on a Cetus Perkin Elmer thermal cycler: 
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1. cycle: 94°C for 2 min., (when the temperature reached 94°C 2.5 U of Taq DNA 
polymerase supplied by Clontech laboratories Inc. was added). 
10 cycles: 94°C for 1 min., 50°C for 1 min., and 72°C for 2 min. 
30 cycles: 94°C for 1 min., 55°C for 1 min., and 72°C for 2 min. 
5 1 cycle: 72°C for 5 min. 

The reaction mixtures were loaded on an agarose gel, and both the A. oryzae and 
the A. niger DNA produced fragments of approximately 1.1 kb. 

The fragments were digested with EcoRI and BamHI and ligated to pUC19 
10 (Yanisch-Perron et al. t Gene 33:103-119, 1985). The ligation mixture was transformed 
into £ coli DH5aF (Woodcock et al., Nucleic Acids Res. 17:3469-3478). Recombinant 
plasmids were subjected to sequence analysis using the Sequenase™ kit (United 
States Biochemical) and a M13 universal primer following the manufacturers 
instructions. The analysis confirmed that both in the case of A. oryzae and in that of A. 
15 niger sequences homologous to other PDI genes were amplified and cloned. 

1.3 Genome cloning of the A. oryzae PDI-encoding gene. 

Genomic DNA from A. oryzae was digested with the following restriction enzymes 
supplied by New England Biolabs Inc.: Hindlll, BamHI, BamHI+Hindlll, EcoRI, 

20 EcoRI+Hindlll, Sail, Sall+Hindlll, Bglll, Bglll+Hindlll, Pstl and Pstl+Hindlll. After 
digestion, the reaction mixtures were run on a 1% agarose gel and then blotted onto an 
Immobilon N™ membrane (Millipore Corporation) following the manufacturers 
instructions. The membrane was probed with the cloned A. oryzae PCR product 
isolated as a BamHI-EcoRI fragment and radio labelled with 32 P. After stringent washes 

25 the membrane was subjected to autoradiography. 

Genomic DNA from A. niger was digested with the following restriction enzymes: Bglll, 
BamHI, BamHI+Bglll, EcoRI, EcoRI+Bglll, Sail, Sall+Bglll, Hindlll, Hindlll+Bglll, Pstl 
and Pstl+Bglll. The Southern blot was made as described with A. oryzae, only the A. 
niger PCR product was used as probe. 

30 

1 .4 Construction of genomic A. oryzae library. 

Southern analysis indicated that the A. oryzae PDI gene was located on a 2.3 kb 
BamH I, Hind HI fragment. Genomic A. oryzae DNA was digested with BamH I and Hind 
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III and fragments ranging from 1.9 - 3 kb were isolated from an agarose gel. This 
mixture of fragments was ligated to pUC19 digested with BamHI and Hind III. The 
ligation mixture was used to transform £ coli DH5aF\ The transformed E. coli cells 
were spread onto 10 agar plates using ampicillin selection. 

5 

1 .5 Screening of the A. oryzae genomic library. 

The libraries were screened using the filter colony hybridization method described 
by Gergen et al. (Nucleic Acids Res. 7:2115-2136, 1979). The probe that was used for 
the Southern blot was also used for the colony hybridization. Positive clones were 
10 isolated and confirmed by sequence analysis using sequencing primers designed from 
the sequences of the PDI fragments. One of the plasmids containing the desired 
fragment was termed pCaHj 425. 

1 .6 Sequence of the gene. 

15 The gene was sequenced using the Tag DyeDeoxy™ Terminator cycle 

sequencing kit supplied by Applied Biosystems following the manufacturer's 
instructions. The sequence reactions were run on an Applied Biosystems 373A DNA 
sequencer and the data were evaluated using the Macintosh computer program SegEd 
version 1.0 supplied by Applied Biosystems. 

20 The sequence of the A. oryzae PDI gene is shown in SEQ ID NO: 12 and the 

corresponding amino acid sequence is shown in SEQ ID NO: 13. 

Expression of a truncated form of the A. oryzae PDI gene. 

Construction of expression plasmids. 
25 The PDI gene of A. oryzae was truncated by introduction of a stop codon. This 
was done by PCR amplification of the PDI gene using a 5' PCR primer harbouring a 
BamH I site at its 5' end and a 3' primer corresponding to a truncation harbouring a 
Hind III site. 

The sequence of the 5' primer was: 
30 5' TTCGGATCCACCATGCGGACTTTCGCACC 3' 5205. 

„ The sequence of the 3' primers was: 
5' CCAAGCTTAGTGTTTCTCGGCGATGAACTT 3' 6314. 

Primer 6314 introduced a stop codon after aminoacid 281. 
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The expression plasmid were constructed by PCR amplification using primer 5205 
in combination with 6314 and pCaHj 425 as template using standard PCR conditions. 
The generated PCR segment was digested with BamH I and Hind III and inserted into 
pMTH 1560 (cf. WO 98/00529) digested with the same enzymes. The constructed 
5 plasmid was named pCaHj 445 (from primer 6314). 

c) Construction of the H61P mutated truncated PDI (expression plasmid pCaHi 548) : 
The truncated PDI of pCaHj 445 was inserted into pCaHj 527 by the digestion of 

pCaHj 445 by BamH I and Xho1, and ligating the PDI fragment to pCaHj 527 di- 
10 gested by the same enzymes. The ligation mixture was transformed into E. coli 

DB6507 as described above to form the plasmid pCaHj 545. 

pCaHj 545 was then used as template for a PCR reaction with the following 

primers: 

160712: CCC TTG CAA GGC TCT CGC TCC (SEQ ID NO:23) 

15 18699: TTG CCC TCATCC CCA TCC TTT (SEQ ID NO:24) 

160712 was phosphorylated in the 5' end. The primer covers the active site of PDI 
and alters Histidine in position 61 to proline. 

The formed PCR segment was digested by Xho I and ligated to the large frag- 
ment of pCaHj 545 being digested by Bal I and Xho I. The ligation mixture was used 
20 to transform E. coli DB6507 as described above to form the plasmid pCaHj 548 (cod- 
ing for amino acid sequence SEQ ID NO: 15 (the H61P mutated truncated PDI), thus 
leading to the polypeptide with an amino acid sequence 21-281 of SEQ ID NO:15. 
The construction of the plasmid is outlined in Figure 2. 

25 d) Transformation into Aspergillus orvzae: 

pCaHj 545 (comprising a nucleotide sequence coding for the amino acid sequence of 
amino acid number 1 to 281 of SEQ ID NO: 13) and pCaHj 548 (comprising SEQ ID 
' NO: 14 coding for the amino acid sequence SEQ ID NO: 15) were transformed into the 
protease weak Aspergillus oryzae strain JaL228 (WO 98/12300), fermented and re- 

30 covered as described in WO 95/00636. 

EXAMPLE 2: Treatment of a-lactalbumin with a protein variant of PDI 
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The following experiments were performed to establish a simple and alternative 
method for reducing the allergenicity of food allergens by the reduction of disulfide 
bonds. This was done by assessing the capacity of protein disulfide isomerases to 
destabilise a-lactalbumin and to increase the susceptibility of this protein to proteoly- 
5 sis in vitro. As a model substrate the cow's milk allergen a-lactalbumin having 4 disul- 
fide bonds was chosen. 

Materials and methods 

Substrate: 
10 250^1 1MK3P0 4) pH6.5 

250 jil [5, 10 or 20 mg/ml] a-lactalbumin 
10 ^l 100 mM p-mercapto-ethanol 

Enzyme solutions: 

PDI concentrations ranging from 0.04 - 4.00 mg/ml for: 
15 a) recombinant fungal protein disulfide isomerase #9601 12/BRNi, and 

b) DsbA 9412.206 22.06-94 BRNi/Bgra 

Trypsin stock solution [5 mg/ml] 
Reaction conditions: 

25 \x\ H 2 0 or enzyme solution was added to 25 \i\ substrate. Trypsin was added 
20 in a trypsin: a-lactalbumin ratio of 1:25. This reaction mixture was incubated 

overnight at 37 °C. 
Analysis conditions: 

20 ii\ Laemmli sample buffer, including 120 mM DTT was added to 20 ^il reaction 
mixture. After 10 min at 70 °C, 10 |il 1M iodoacetamide was added. Electropore- 
25 sis was performed at 125 V for 110 min on a 4-20% acrylamide gradient gel 
(NOVEX Pre-Cast Gels). Electroblotting to nitrocellulose membranes was per- 
formed at 30 V for 90 min. Protein staining was done as described in F-9503186. 

Results 

30 The results (Figure 5 and Figure 6) show that treatment of a-lactalbumin with a 
recombinant PDI, but not DsbA, destabilises a-lactalbumin and makes it more sus- 
ceptible to trypsin action under the current conditions for proteolysis. When 5 mg/ml 
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a-lactalbumin was incubated with 0.20 mg/ml PDI, all a-lactalbumin was digested by 
trypsin. 

Optimisation of the PDI: substrate' ratio reveals that all a-lactalbumin can be 
digested by trypsin when there is no more than a 50 X excess (in mg) as compared 
to PDL Accordingly the ratio is 1 mole of PDI (MW = 90 kDa) per 300 moles of cc- 
lactalbumin (MW= 15 kDa). 

From the above it can be concluded that recombinant fungal protein disulfide 
isomerase has a significant effect on the stability of the cow's milk allergen a- 
lactalbumin. 
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CLAIMS 

1. A polypeptide capable of reducing disulfide bonds, the polypeptide comprising an 
amino acid sequence selected from the group consisting of: 

(i) an amino acid sequence having at least 60% identity with the amino 
acid sequence set forth in SEQ ID NO:15 or amino acid number 21- 
281 of SEQ ID NO:15; 

(ii) an amino acid sequence having at least 60% similarity with the amino 
acid sequence set forth in SEQ ID NO: 15 or amino acid number 21- 
281 of SEQ IDNO:15; 

(iii) an amino acid sequence having at least 60% identity with the amino 
acid sequence set forth in SEQ ID NO: 17; and 

(iii) an amino acid sequence having at least 60% similarity with the amino 
acid sequence set forth in SEQ ID NO: 17; 
provided that a position in the polypeptide corresponding to amino acid residues 
numbered 58-61 in SEQ ID NO: 13 is selected from the group consisting of: 

i) Cys-Gly-Pro-Cys, 

ii) Cys-Ala-Thr-Cys, 

iii) Cys-Val-Leu-Cys, and 

iv) Cys-Gly-Tyr-Cys. 

2. The polypeptide of claim 1, wherein the comprised amino acid sequence has at 
least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at 
least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity or 
similarity with the amino acid sequence set forth in SEQ ID NO: 15, amino acid num- 
ber 21-281 of SEQ ID NO:15 or SEQ ID NO:17. 

3. The polypeptide according to claim 1 , which polypeptide has Cys-Gly-Pro-Cys in 
the position corresponding to amino acid residues numbered 58-61 in SEQ ID NO: 13. 

4. The polypeptide of claim 3, wherein the comprised amino acid sequence has at 
at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at 
least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity or 
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similarity with the amino acid sequence set forth in SEQ ID NO: 15, SEQ ID NO: 17 or 
amino acid number 21-281 of SEQ ID NO: 15. 

5. A polypeptide capable of reducing disulfide bonds, which polypeptide is encoded 
5 by a nucleotide sequence which hybridizes under low, medium or high stringency 
conditions with (i) SEQ ID NO: 14; (ii) SEQ ID NO: 16; (iii) a subsequence of (i) or (ii) 
of at least 100 nucleotides; or (iv) a complementary strand of (i), (ii) or (iii); provided 
that a position of the polypeptide corresponding to amino acid residues numbered 
58-61 in SEQ ID NO: 13 is selected from the group consisting of: 
10 i) Cys-Gly-Pro-Cys, 

ii) Cys-Ala-Thr-Cys, 

iii) Cys-Val-Leu-Cys, and 

iv) Cys-Gly-Tyr-Cys. 

15 6. The polypeptide according to claim 5, which polypeptide has Cys-Gly-Pro-Cys in 
the position corresponding to amino acid residues numbered 58-61 in SEQ ID NO:13. 

7. The polypeptide according to any of the preceding claims, wherein the comprised 
amino acid sequence is SEQ ID NO: 15 or SEQ ID NO: 17 or amino acid number 21- 

20 281 of SEQ ID NO: 15. 

8. The polypeptide according to any of the preceding claims, which polypeptide con- 
sists of the amino acid sequence SEQ ID NO:15 or SEQ ID NO:17 or amino acid 
number 21 -281 of SEQ ID NO: 15. 

25 

9. A polypeptide capable of reducing disulfide bonds, which polypeptide 

(i) is encoded by the nucleotide sequence contained in plasmid 
pCaHj548. 

(ii) comprises an amino acid sequence having at least 60%, at least 65%, at 
30 least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at 

least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 
100% identity or similarity with the amino acid sequence encoded by 
the nucleotide sequence contained in plasmid pCaHj548; or 



i 
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(iii) consist of an amino acid sequence having at least 60%, at least 65%, at 
least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at 
least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 
100% identity or similarity with the amino acid sequence encoded by 
5 the nucleotide sequence contained in plasmid pCaHj548, 

provided that a position of the polypeptide corresponding to amino acid residues 
numbered 58-61 in SEQ ID NO: 13 is selected from the group consisting of: 

i) Cys-Gly-Pro-Cys, 

ii) Cys-Ala-Thr-Cys, 

10 iii) Cys-Val-Leu-Cys, and 

iv) Cys-Gly-Tyr-Cys. 

10. A polypeptide capable of reducing disulfide bonds, the polypeptide consisting of 
an amino acid sequence having: 

15 (i) at least 60% identity or similarity with the amino acid sequence set 

forth in SEQ ID NO:15 or with amino acid number 21-281 of SEQ ID 
NO:15; or 

(ii) at least 60% identity or similarity with the amino acid sequence set 
forth in SEQ ID NO:17; 
20 provided that a position in the polypeptide corresponding to amino acid residues 
numbered 58-61 in SEQ ID NO: 13 is selected from the group consisting of: 

i) Cys-Gly-Pro-Cys, 

ii) Cys-Ala-Thr-Cys, 

iii) Cys-Val-Leu-Cys, and 
25 iv) Cys-Gly-Tyr-Cys. 

11. The polypeptide according to claim 10, which polypeptide has Cys-Gly-Pro-Cys in 
the position corresponding to amino acid residues numbered 58-61 in SEQ ID NO:13. 

30 12. The polypeptide of claim 10 or 1 1 , wherein the amino acid sequence has at least 
65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 
95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% identity or simi- 
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larity with the amino acid sequence set forth in SEQ ID NO: 15 or SEQ ID NO: 17 or 
amino acid number 21-281 of SEQ ID NO:15. 

13. A polypeptide capable of reducing disulfide bonds, which polypeptide is a variant 
of a polypeptide having the amino acid sequence of SEQ ID NO: 15, SEQ ID NO: 17 
or amino acid number 21-281 of SEQ ID NO: 15; the variant comprising substitu- 
tions), deletion(s), and/or insertion(s) of one or more amino acids; provided that the 
variant comprises an amino acid sequence selected from the group consisting of: 

i) Cys-Gly-Pro-Cys, 

ii) Cys-Ala-Thr-Cys, 

iii) Cys-Val-Leu-Cys, and 

iv) Cys-Gly-Tyr-Cys, 
in a position corresponding to amino acid residues numbered 58-61 in SEQ ID 
NO:13. 

14. The polypeptide according to claim 13, which polypeptide has Cys-Gly-Pro-Cys 
in the position corresponding to amino acid residues numbered 58-61 in SEQ ID 
NO:13. 

20 15. The polypeptide of claim 13 or 14, having an amino acid sequence having 

(i) at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at 
least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at 
least 98%, or at least 99% identity or similarity with SEQ ID NO: 15 or 
SEQ ID NO:17 or amino acid number 21-281 of SEQ ID NO:15; or 

25 (ii) 100% similarity with SEQ ID NO: 15 or SEQ ID NO: 17 or amino acid 

number 21-281 of SEQ ID NO:15. 

16. The polypeptide according to any of the preceding claims further comprising an 
amino acid sequence which has at least 60%, at least 65%, at least 70%, at least 
30 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 
97%, at least 98%, at least 99% or 100% identity or similarity with the amino acid se- 
quence set forth in SEQ ID NO: 18 provided the position corresponding to amino acid 
residues numbered 31-34 in SEQ ID NO: 18 is selected from the group consisting of: 
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(i) Cys-Gly-Pro-Cys, 

(ii) Cys-Ala-Thr-Cys, 

(iii) Cys-Val-Leu-Cys, 

(iv) Cys-Gly-Tyr-Cys, and 
5 (v) Cys-Gly-His-Cys 



17. A polypeptide capable of reducing disulfide bonds, the polypeptide being a vari- 
ant of a parent protein disulfide isomerase by having an amino acid sequence differ- 
ing from that of the parent protein disulfide isomerase in a position corresponding to 

10 amino acid residues numbered 58-61 in SEQ ID NO: 13 , which position is Cys-Gly- 
Pro-Cys for the variant polypeptide. 

18. The polypeptide according to claim 17 having substitution(s), deletion(s), and/or 
insertion(s) of one or more amino acids compared to the parent protein disulfide 

15 isomerase, provided that the polypeptide comprises the amino acid sequence Cys- 
Gly-Pro-Cys in a position corresponding to amino acid residues numbered 58-61 in 
SEQ ID NO:13. 

19. The polypeptide of claim 18, wherein the amino acid sequence of the polypeptide 
20 has at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 

90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% identity or 
similarity with the amino acid sequence of the parent protein disulfide isomerase. 

20. The polypeptide of any of claims 17-19, wherein the parent protein disulfide 
25 isomerase is a PDI from A. oryzae or a subsequence thereof, SEQ ID NO: 13, or a 

subsequence of SEQ ID NO: 13 having protein disulfide isomerase activity, such as, 
e.g. amino acid no. 21-115 of SEQ ID NO: 13, e.g. amino acid no. 1-281 or 21-281 of 
SEQ ID NO: 13. 



30 21. The polypeptide of any of claims 17-20, wherein the parent protein disulfide 
isomerase is encoded by a nucleotide sequence which hybridises under low, medium 
or high stringency conditions with (i) SEQ ID NO:14 or SEQ ID NO:16; (ii) nucleotides 
131 to 365 of SEQ ID NO: 12); (iii) a subsequence of (i) or (ii) of at least 100 nucleo- 
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tides; or (iv) a complementary strand of (i), (ii) or (iii); provided that the position of the 
polypeptide corresponding to amino acid residues numbered 58-61 in SEQ ID NO: 13 
is Cys-Gly-Pro-Cys. 

22. A method for providing a polypeptide capable of reducing disulfide bond, the 
method comprising a step selected from the group consisting of: 

(a1) altering at least one amino acid residue in a parent protein disulfide isom- 
rease by amending the amino acid Xi to Glu and/or the amino acid Y2 to 

Pro in the site corresponding to Cys-X r Y 2 -Cys of the parent protein disul- 

« 

fide isomerase (PDI), 

to obtain a variant comprising Cys-Gly-Pro-Cys; 
(a2) altering at least one amino acid residue in a parent protein disulfide isom- 
rease by amending the amino acid X1 to Ala and/or the amino acid Y2 to 
Thr in a site corresponding to Cys-X r Y 2 -Cys of the parent protein disulfide 
isomerase (PDI), 

to obtain a variant comprising Cys-Ala-Thr-Cys; 
(a3) altering at least one amino acid residue in a parent protein disulfide isom- 
rease by amending the amino acid X1 to Val and/or the amino acid Y2 to 
Leu in the site corresponding to Cys-X r Y 2 -Cys of the parent protein disul- 
fide isomerase (PDI), 

to obtain a variant comprising Cys-Val-Leu-Cys; and 
(a4) altering at least one amino acid residue in a parent protein disulfide isom- 
rease by amending the amino acid X1 to Gly and/or the amino acid Y 2 to 
Tyr in the site corresponding to Cys-X r Y 2 -Cys of the parent protein disul- 
fide isomerase (PDI), 

to obtain a variant comprising Cys-Gly-Tyr-Cys. 

23. The method according to claim 22, comprising the step (a1 ) and wherein Cys- 
Gly-Pro-Cys is in a position corresponding to amino acid number 58-61 of SEQ ID 
NO:13. 

24. The method of claim 23, further comprising the step (b) of substitution(s), dele- 
tion^), and/or insertion(s) of one or more amino acids compared to the parent protein 
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disulfide isomerase, provided that the produced polypeptide comprises the amino 
acid sequence Cys-GIy-Pro-Cys in a position corresponding to amino acid residues 
numbered 58-61 in SEQ ID NO: 13. 

5 25. The method of any of claims 22-24, wherein the polypeptide has an amino acid 
sequence which has at least 60%, at least 65%, at least 70%, at least 75%, at least 
80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 
98%, at least 99% identity or similarity with the amino acid sequence of the parent 
protein disulfide isomerase. 

10 

26. A polypeptide capable of reducing disulfide bonds obtainable by the method of 
any of claims 22-25. 

27. A fusion polypeptide comprising a polypeptide according to any of the preceding 
15 claims and a fusion partner. 

28. The fusion polypeptide according to claim 27, wherein said other protein fragment 
is capable of facilitating expression and purification of the polypeptide. 

20 29. The fusion polypeptide according to claim 28, wherein the fusion partner is amino 
acid number 1-20 of SEQ ID NO: 15. 

30. The polypeptide according to any of the preceding claims capable of being ex- 
pressed extracellularly in a microbial host organism. 

25 

31. The polypeptide according to any of claims 1-21 or any of claims 26-30 having a 
redox potential which is decreased with at least 10mV, at least 20mV, at least 30mV 
compared to the protein disulfide isomerase having the amino acid sequence SEQ ID 
NO: 13 or amino acid number 21-281 of SEQ ID NO: 13. 

30 

32. The polypeptide according to any claims 1-21 or any of claims 26-30 having a 
redoxpotential of less than -200mV. 
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33. The polypeptide according to any of the claims 17-20 or 26 having a redox poten- 
tial which is lower than that of the parent disulfide isomerase. 

34. The polypeptide according to claim 33 having a redox potential which is at least 
30 mV lower as compared to the redoxpotential of the parent protein disulfide isom- 
erase. 

35. The polypeptide according to any of claims 1-21 or 26-30 having a redoxpotential 
which is lower than that of (i) SEQ ID NO: 13 without amino acid number 1-20 or (ii) 
the amino acid sequence given by amino acid number 21-281 of SEQ ID NO:13. 

36. The polypeptide according to any of the preceding claims, which polypeptide is a 
protein disulfide isomerase (PDI, EC 5.3.4.1). 

37. A polypeptide which is a protein disulfide isomerase variant having increased re- 
ducing properties when compared to the wild-type protein, with the formula of: 

X - A - [Z - B]„ - Y, wherein 

A is the amino acid sequence: Cys-Gly-Pro-Cys, 

X is an amino acid sequence comprising a sequence corresponding to SEQ ID 
NO:1 or a functional equivalent thereof, 

Y is an amino acid sequence comprising a sequence corresponding to SEQ ID 
NO:2 or a functional equivalent thereof, 

Z is an amino acid sequence comprising a sequence corresponding to SEQ ID 
NO: 11 or a functional equivalent thereof. 

B is individually selected from the amino acid sequence Cys-Gly-Pro-Cys or 
Cys-Gly-His-Gly or Cys-Ala-Thr-Cys or Cys-Pro-His-Cys or Cys-Val-Leu-Cys 
or Cys-Gly-Tyr-Cys, 
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n = 0 or an integer from 1 to 3 

38. The protein variant according to claim 37, wherein Y is an amino acid sequence 
5 selected from the group consisting of SEQ ID NO:3; SEQ ID NO:4; SEQ ID NO:5; 

SEQ ID IMO:6; SEQ ID NO:7; SEQ ID NO:8; SEQ ID NO:9; and SEQ ID NO:10. 

39. The polypeptide according to claim 37 or 38, wherein n is 1. 
10 40. The polypeptide according to claim 37 or 38, wherein n is 0. 

41 . The polypeptide according to claim 40, which polypeptide has an amino acid se- 
quence having at least 60%, at least 65%, least 70%, at least 75%, at least 80%, at 
least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at 

15 least 99% identity or similarity with the amino acid sequence shown as amino acids 
21-281 of SEQ ID NO:13. 

42. A polypeptide which is a protein disulfide isomerase having an amino acid se- 
quence which in a position corresponding to amino acid residues numbered 58-61 in 

20 SEQ ID NO: 13 is Cys-Gly-Pro-Cys. 

43. A nucleic acid comprising a nucleotide sequence encoding the polypeptide of any 
of the preceding claims. 

25 44. The nucleic acid according to claim 43 which is DNA. 

45. A nucleotide sequence encoding a polypeptide as defined in any of the preceding 
claims. 

30 46. A nucleic acid construct comprising the nucleotide sequence of claim 45 operably 
linked to one or more control sequences that direct the production of the polypeptide 
in a suitable expression host. 
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47. A vector comprising the nucleotide sequence of claim 45. 

48. The vector according to claim 47 which is a recombinant expression vector. 
5 49. The vector according to claim 47 or 48 which is a plasmid. 

50. A transformed cell harbouring the vector of any of claims 47-49. 

51. The transformed cell according to claim 50 capable of expressing the polypep- 
10 tide. 

52. A recombinant host cell comprising the nucleic acid construct of claim 46, which 
cell is capable of expressing the encoded polypeptide. 

15 53. The cell according to any of claims 50-52, which is a microbial cell 

54. The cell according to claim 53 which is a bacterial or a fungal cell. 

55. The cell according to claim 54, which cell is a gram-positive bacterium, e.g. of the 
20 genus Bacillus or Streptomyces or a cell of a gram-negative bacterium, e.g. of the ge- 
nus Escherichia. 

56. The cell according to claim 54, which cell is a yeast cell, e.g. of the genus Sac- 
charomyces, or a cell of a filamentous fungus, e.g. of the genus Aspergillus or Fu- 

25 sarium. 

57. The cell according to claim 56, which cell is an Aspergillus species selected from 
the group consisting of A oryzae, A. niger and A. nidulans. 

30 58. A method of producing a polypeptide according to any of claims 1-21 or 26-42, 
wherein a host cell containing an expression cassette comprising a nucleotide se- 
quence encoding the polypeptide is cultured in a suitable medium under conditions 
promoting the expression of the protein. 
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59. A method of producing a polypeptide according to any of claims 1-21 or 26-42, 
comprising inserting a nucleic acid according to claim 40 into a vector which is able to 
replicate in a host cell, introducing the resulting recombinant vector into the host cell, 

5 culturing the host cell in a culture medium under conditions sufficient to effect ex- 
pression of the polypeptide, and recovering the polypeptide from the host cell or the 
culture medium; or synthesising the polypeptide by solid or liquid phase synthesis. 

60. A method of producing the polypeptide of any of claims 1-21 or 26-42, which 
10 method comprises the step of (i) cultivating the cell of any of claims 47-54 so as to 

express and optionally secrete the polypeptide; and (ii) recovering the polypeptide. 

61 . The method according to any of claims 58-60, in which the polypeptide is ex- 
pressed in the form of a proenzyme and the cell is cultured in the presence of a pro- 

15 teolytic enzyme capable of converting the proenzyme of the polypeptide into the ma- 
ture polypeptide. 

62. Use of a polypeptide according to any of claims 1-21 or 26-42 for reducing the 
allergenicity of a protein. 

20 

63. Use of a polypeptide according to any of claims 1-21 or 26-42 in food or feed 
manufacturing. 

64. Use of a polypeptide according to any of claims 1-21 or 26-42 in the preparation 
25 of an enzyme preparation for use in food or feed manufacturing. 

65. The use according to claim 63 for the reduction of allergens in food or feed, e.g. 
gluten or milk based products, including beverages, such as infant formula and die- 
tary drinks. 

30 

66. Use according to claim 63 for increasing the digestibility of food or feed, such as, 
e.g., for increasing the digestibility of milk or wheat based food or feed products. 
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67. Use of a polypeptide according to any of claims 1-21 or 26-42 in the manufactur- 
ing of a cosmetic product. 

68. Use of a polypeptide according to any of claims 1-21 or 26-42 treating scleropro- 
5 teins, such as e.g. human or animal hair, cleaning fabrics. 

69. A composition comprising a polypeptide as defined in any of the claims 1-21 or 
26-42. 

10 70. The composition according to claim 69 additionally comprising at least another 
enzyme, such as a protease, an amylase, a lipase, a hydrolase, a peroxidase, a cel- 
lulase, a transglutaminase, a glucose oxidase, a xylanase, a pectin methyl esterase 
or a pectin lyase. 

15 71. A food additive comprising a polypeptide as defined in any of claims 1-21 or 26- 
42. 

72. A cosmetic comprising a polypeptide as defined in any of claims 1-21 or 26-42. 
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Figure 1: Construction of pCaHj 527: 
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Figure 2: Construction of pCaHj 548 
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Figure 3 
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Figure 4 





T. re- 
seii 


A. 

oryzae 


H. inso- 
lens 


S. cere- 
vis iae 


A, niger 


T. reseii 




58.6 


71.7 


35.9 


56.6 


A. oryzae 


55.7 




59.8 


33.6 


77.9 


H, insolens 


34.1 


54.4 


*** 


36.6 


58.4 


S. cere- 
visiae 


104.5 


114.4 


102.6 


*** 


33.8 


A. niger 


58.7 


23.8 


58.6 


112.2 


++* 




T. re- 
seii 


A. 

oryzae 


H. inso- 
lens 


S. cerevi- 
siae 


A. niger 





A ORYZAE . PRO 


(1) 




T_RESEII.PRO 


(1) 




H_INSOLENS . PRO 


(1) 




CEREVISIAE.PRO 


(1) 




A_NIGER.PRO 


(1) 




Consensus 


(1) 




A ORYZAE. PRO 


(48) 




T RESEII. PRO 


(41) 




H INSOLENS. PRO 


, (40) 


s m 


CEREVISIAE.PRO 


(51) 




A_NIGER. PRO 


(44) 




Consensus 


(51) 




A ORYZAE. PRO 


(98) 




T RESEII. PRO 


(91) 




H INSOLENS. PRO 


(90) 


s m 


CEREVISIAE.PRO 


(101) 




A_NIGER. PRO 


(94) 




Consensus 


(101) 




A ORYZAE. PRO 


(146) 




T RESEII. PRO 


(139) 




H_INSOLENS . PRO 


(138) 


s m 


CEREVISIAE.PRO 


(149) 




A_NIGER. PRO 


(142) 




Consensus 


(151) 




A ORYZAE. PRO 


(194) 




T_RESEII.PRO 


(187) 




H_INSOLENS . PRO 


(186) 


s 


CEREVISIAE.PRO 


(199) 




A_NIGER. PRO 


(190) 




Consensus 


(201) 



1 50 
PrFAPHLS ilf|SAVA|AA|ATAEAP|D|VSiT^PTfiKEHD 

Pkaqi^alg— |i^--aawa |d|vJk^»i^d 

|S |l^SAW|AAiTE- |l»|DQir-pSF^EHG 

LLAASAV AAAD 



SSFAP! 
MR A WILS 
51 

8 



SDWSL DTFEDFINSHD 

_100 

EE] 






§SEVTK||N 
IWA — D 



LVLAEFFAPWCGHCKALAPEYEEAATTLKEKNI LAKVDCTEE DLCREH 
101 150 

t~GLDAVKPi^AiQTE?^VS , 



SPVT 

saIt 



GV^YfrgpiR-- 

GV^|T^|R--GLDNVSP|KiQlKAAAiTS 
NlP^^IplNSDiNNSIDiElPlT. 
GV^t^^IiR--^SSKP|^^QTE^|VSYil 
GVEGYPTLKI FR GLD V PY G R AEAIVSYMIKQSLPAVS VT DN 
151 200 
lEEI KTMDKllpSYIAS D~lQTA^SllTTEfESQRDl#^^^S 



HsDi 





Sm**** H» 





pP^^^^ ^W^ 1 ^^iP S GKI pAD FNATpYS^NKH FNDpI>||V||AEl 
f|EEI KTMDK^^^^YI P S D — gQE^YQ^EKY^SQRDN^Ir| ia5W ^ ac ~ " 
LEEFKT DKIVLIAYI SD D SNE FT LAE RDNYLFAAT DAAI 
201 250 

AKAEGVKQlS I VLYKDFDEKKAT YDGE 1 EQ§A|iLSl 

AEAEGVKS|blVLYKSFDEGKNV|SEK FfAEAIRNiAQVAAT-^ 

AEAEGVKAg|§I VL YKDFDEGKAVgS EK FEVEAI E] 

DFKLS1yl|sa| |EPV\^GKKADIAiA|VFEKWiQVEAL| 

ak§egveq!s i vlykdfdekkavydge 1 eqeai hsw|k§^t§lv| 

a aegvk psivlykdfde kavy gk e eai wvktaatplvg 




WO 00/70064 



PCT7DK00/00265 



5/8 



Figure 4 (continued) 



A_ORYZAE.PRO 
T_RESEII.PRO 
H_INSOLENS . PRO 
S_CEREVISIAE.PRO 
AJMIGER. PRO 
Consensus 

A_ORYZAE . PRO 
T_RE5EII.PRO 
H_INSOLENS . PRO 
S_CEREVTSIAE. PRO 
A_NIGER.PRO 
Consensus 

A_ORY ZAE . PRO 
T_RESEII.PRO 
H_INSOLENS.PRO 
S_CEREVI S IAE . PRO 
A_NIGER.PRO 
Consensus 

A_ORYZAE.PRO 
T_RESEII.PRO 
H_INSOLENS . PRO 
S_CEREVISIAE. PRO 
A_NIGER. PRO 
Consensus 

A ORYZAE.PRO 
T~RESEII.PRO 
H_INSOLENS . PRO 
S_CEREVTSIAE.PRO 
A_NIGER. PRO 
Consensus 

A_ORYZAE . PRO 
T RESEII.PRO 
H_INSOLENS . PRO 
S_CEREVISIAE.PRO 
A_NIGER. PRO 
Consensus 



240 
233 
232 
243 
236 
251 

290 
283 
282 
293 
286 
301 

331 
324 
323 
342 
327 
351 

379 
372 
371 
392 
375 
401 

425 
421 
420 
439 
425 
451 

474 
470 
469 
489 
474 
501 



251 300 

ilGPETYSG§I|A!llf^^ 




1IGPETYS 

- - - 'W n b >-V"V UP-1.TI V t * "I -a T -. -ii > > • '= » - » » " " " 

ILEE^KPLFT 

EI GPETYSGYI SAGI PLAYI FAET EERE YTE FKPIAEKHKG INFAT 
301 350 
HKLliAmH*DPSI«»DPEKNA£ -P|fE- 

Pf^«W"f«i#HI PEf^KE- 

»AFiA^lfLKi|^|A|^^« *W E *%;, 

TKFiRiMlliK-i^I^I^EDIpGLPQLSEEA^LSiK 

|A|^QDPAKNAiY *&Q$KE- 

IDAK FGAHAGNLNLKTDKFPAFAIQDI KN KY PFDQSKE 
351 400 
[IGKEfQDVLDDiqpPS XSEAX PHiEGPpTWf§AHSY 
:i^IAiFVDGFSSGKiEASI^EPIP^^EGP5iHrVV3KAHSYKDI 

|tfeaik§fvddf^agk§eps I 

(LVKDFLKGDASPI^QEIE#I^SSiFQ^§KNiIlI 

:PSI^EPlP»:GPpV\^AHSYKDS 
I AKDI AFV DFL GKIEPSIKSEPIPETQEGPVTVVVAHSYKDIVL 
401 4S0 

iiiKDiFi — m 





Ml^SMVAW 



' V%<' >v{%<V<%rtriwhvrtvi>W JMWWM >>^WV<<V'.J. 

SP Kte$#2^ 




D KDVLI E FYAPWC GHC KALAPKYD ELAALYA SD KWIAKIDAT 
451 500 

isfTVEDLANpi^ill^KV 
Hi KiYPA|D|KNPVTf S^TVEDFI E3 

ASPDE-l^^^YPAiAiGQPVTlslstrVEDLIlllAS 1 -. 

' - - * - - '■- ""■"•»«SLF0^II^iHFDV 

A»DP-|TiF^pPAiAiDSPiEiSlSirVEDI^I«I#^ 
ANDVPD I GFPTIKLYPAGAK PV YSGSRTVEDL FIKENGKHKV 
501 542 
DAiEjDPKKEQE^^ATETRAASDETETPAjTS] 

KiSEEAKA|EET 




# 



JT 



,TfT 



i Si 



;EETQE( 



.TEAETPAiTD^ 




DAL V AE 



AGE TEA AS AE E AA DE A HDEL 



WO 00/70064 



PCT/DK00/00265 



6/8 



Figure 5 




1. 5mg/ml a-lactalbumin (aLA) + trypsin 

2. aLA + 0.04 mg/m! PDI + trypsin 

3. aLA + 0.20 mg/ml PDI + trypsin 

4. aLA + 1.00 mg/ml PDI + trypsin 

5. aLA + 4.00 mg/ml PDI + trypsin 

6. markers 

7. aLA + 0.04 mg/ml DsbA + trypsin 

8. aLA + 0.20 mg/ml DsbA + trypsin 

9. aLA + 1.00 mg/ml DsbA + trypsin 
0. aLA + 4.00 mg/ml DsbA + trypsin 

Incubation was overnight at 37 °C. 
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Figure 6 




1. 10 mg/ml aLA + 0.08 mg/ml PDI + trypsin 

2. aLA + 0.20 mg/ml PDI + trypsin 

3. aLA + 0.40 mg/ml PDI + trypsin 

4. 20 mg/ml aLA + 0.16 mg/ml PDI + trypsin 
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Figure 7 
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Figure 7 (continued) 



Pdi 
Pdi* 
Pdi" 
Pdi 
Pdi" 
Pdi" 
Pdi 



_Mouse 
_Rat 
JBovin 
_Human 
_Rabit 
[chick 
Yeast 



301 

LFIFIDSDHT 
LFIFIDSDHT 
LFIFIDSDHT 
LFIFIDSDHT 
LFIFIDSDHA 
LFIFIDSDHS 
LMNFVSIDAR 



DNQRILEFFG 
DNQRILEFFG 
DNQRILEFFG 
DNQRILEFFG 
DNQRILEFFG 
DNQRILEFFG 
KFGRHAGNLN 



350 

LKKEECPAVR LITLEEEM TKY 

LKKEECPAVR LITLEEEM TKY 

LKKEECPAVR LITLEEEM TKY 

LKKEECPAVR LITLEEEM TKY 

LKKEECPAVR LITLEEEM TKY 

LKKEECPAVR LITLEEEM TKY 

M.KEQFPLFA IHDMTEDLKY GLPQLSEEAF 



Pdi 

Pdi" 

Pdi' 

Pdi 

Pdi 

Pdi 

Pdi 



_Mouse 

_Rat 

JBovin 

_Human 

Rabit 

[chick 

Yeast 



351 

KPESDELTAE 
KPESDELTAE 
KPESDELTAE 
KPESEELTAE 
KPESDELTAE 
KPESDDLTAD 
DELSDKIVLE 



K. . ITEFCHR 
K. - ITQFCHH 
K. .ITEFCHR 
R. .ITEFCHR 
G. . ITEFCQR 
K. . IKEFCNK 
SKAIESLVKD 



FLEGKIKPHL 

FLEGKIKPHL 

FLEGKIKPHL 

FLEGKIKPHL 

FLEGKIKPHL 

FLEGKIKPHL* 

FLKGDASPIV 



MSQEVPEDWD 
MSQELPEDWD 
MSQELPDDWD 
MSQERAGDWD 
MSQELPEDWD 
MSQDLPEDWD 
KSQEIFENQD 



400 

KQPVKVLVGA 
KQPVKVLVGK 
KQPVKVLVGK 
KQPVKVPVGK 
RQPVKVLVGK 
KQPVKVLVGK 
S . SVFQLVGK 



Pdi 

Pdi" 

Pdi" 

Pdi 

Pdi 

Pdi 

Pdi 



_Mouse 
_Rat 
_Bovin 
_Human 
_Rabit 
Chick 
Yeast 



401 

NFEEVAFDEK 
NFEEVAFDEK 
NFEEVAFDEK 
NFEDVAFDEK 
NFEEVAFDEK 
NFEEVAFDEN 
NHDEIVNDPK 



KNVFVEFYAP 
KNVFVEFYAP 
KNVFVEFYAP 
KNVFVEFYAP 
KNVFVEFYAP 
KNVFVEFYAP 
KDVLVLYYAP 



WCGHCKQLAP 
WCGHCKQLAP 
WCGHCKQLAP 
WCGHCKQLAP 
WCGHCKQLAP 
WCGHCKQLAP 
WCGHCKRLAP 



IWDKLGETY. 
IWDKLGETY. 
IWDKLGETY . 
IWDKLGETY . 
IWDKLGETY. 
IWDKLGETY . 
TYQELADTYA 



450 

KDHENIIIAK 
KDHENIVIAK 
KDHENIVIAK 
KDHENIVIAK 
KEHQDIVIAK 
RDHENIVIAK 
NATSDVLIAK 



Pdi 
Pdi 
Pdi 
Pdi 
Pdi 
Pdi 
Pdi 



Mouse 

Rat 

JBovin 

Jiuman 

_Rabit 

[chick 

Yeast 



451 

MDSTANEVEA 
MDSTANEVEA 
MDSTANEVEA 
MDSTANEVEA 
MDSTANEVEA 
MDSTANEVEA 
LDHTENDVRG 



VKVHSFPTLK 
VKVHSFPTLK 
VKVHSFPTLK 
VKVHSFPTLK 
VKVHSFPTLK 
VKIHSFPTLK 
WIEGYPTIV 



FFPASADRTV 
FFPASADRTV 
FFPASADRTV 
FFPASADRTV 
FFPAGPGRTV 
FFPAGSGRNV 
LYPGGKKSES 



IDYNGERTLD 
IDYNGERTLD 
IDYNGERTLD 
IDYNGERTLD 
IDYNGERTLD 
IDYNGERTLE 
WYQGSRSLD 



500 

GFKKFLESGG 
GFKKFLESGG 
GFKKFLESGG 
GFKKFLESGG 
GFKKFLESGG 
GFKKFLESGG 
SLFDFIKENG 



Pdi 
Pdi 
Pdi 
Pdi 
Pdi 
Pdi 
Pdi 



Mouse 

Rat 

Bovin 

_Human 

Rabit 

[chick 

Yeast 



501 

QDGAGDDEDL 
QDGAGDNDDL 
QDGAGDDDDL 
QDGAGDDDDL 
QDGAGDEDGL 
QDGAAADDDL 
HFDVDGKALY 



. DLEE . .ALE 
. DLEE . .ALE 
EDLEE. .AEE 
EDLEE . . AEE 
EDLEE . .AEE 
EDLET. .DEE 
EEAQEKAAEE 



PDMEE . . DDD 
PDMEE. .DDD 
PDLEE . . DDD 
PDMEE . . DDD 
PDLEE . . DDD 
TDLEEGDDDE 
ADADAELADE 



538 
QKAVKDEL 
QKAVKDEL 
QKAVKDEL 
QKAVKDEL 
QKAVRDEL 
QKIQKDEL 
EDAIHDEL 



WO 00/70064 



PCT/DKOO/00265 



SEQUENCE LISTING 

<110> NOVO NORDISK A/S 

<120> Novel Protein Disulfide Isomerase variants 
<130> 5896A 

<140> 
<141> 

<160> 24 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 57 
<212> PRT 

<213> Aspergillus oryzae 
<400> 1 

Met Arg Thr Phe Ala Pro Trp lie Leu Ser Leu Leu Gly Ala Ser Ala 
15 10 15 

Val Ala Ser Ala Ala Asp Ala Thr Ala Glu Ala Pro Ser Asp Val Val 

20 25 30 

Ser Leu Thr Gly Asp Thr Phe Glu Thr Phe Val Lys Glu His Asp Leu 
35 40 45 

Val Leu Ala Glu Phe Phe Ala Pro Trp 
50 55 



<210> 2 
<211> 454 
<212> PRT 

<213> Aspergillus oryzae 
<400> 2 

Lys Ala Leu Ala Pro Lys Tyr Glu Gin Ala Ala Thr Glu Leu Lys Glu 
15 10 15 

Lys Asn lie Pro Leu Val Lys Val Asp Cys Thr Glu Glu Glu Ala Leu 

20 25 30 

Cys Arg Asp Gin Gly Val Glu Gly Tyr Pro Thr Leu Lys lie Phe Arg 

1 
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35 40 45 

Gly Leu Asp Ala Val Lys Pro Tyr Gin Gly Ala Arg Gin Thr Glu Ala 
50 55 60 

He Val Ser Tyr Met Val Lys Gin Ser Leu Pro Ala Val Ser Pro Val 
65 70 75 80 

Thr Pro Glu Asn Leu Glu Glu He Lys Thr Met Asp Lys He Val Val 

85 90 95 

He Gly Tyr He Ala Ser Asp Asp Gin Thr Ala Asn Asp He Phe Thr 

100 105 110 

Thr Phe Ala Glu Ser Gin Arg Asp Asn Tyr Leu Phe Ala Ala Thr Ser 
115 120 125 

Asp Ala Ser He Ala Lys Ala Glu Gly Val Lys Gin Pro Ser He Val 
130 135 140 

Leu Tyr Lys Asp Phe Asp Glu Lys Lys Ala Thr Tyr Asp Gly Glu He 
145 150 155 160 

Glu Gin Asp Ala Leu Leu Ser Trp Val Lys Thr Ala Ser Thr Pro Leu 

165 170 175 

Val Gly Glu Leu Gly Pro Glu Thr Tyr Ser Gly Tyr He Thr Ala Gly 

180 185 190 

He Pro Leu Ala Tyr He Phe Ala Glu Thr Lys Glu Glu Arg Glu Gin 
195 200 205 

Phe Thr Glu Glu Phe Lys Phe He Ala Glu Lys His Lys Gly Ser He 
210 215 220 

Asn He Val Thr He Asp Ala Lys Leu Tyr Gly Ala His Ala Gly Asn 
225 230 235 240 

Leu Asn Leu Asp Pro Ser Lys Phe Pro Ala Phe Ala He Gin Asp Pro 

245 250 255 

Glu Lys Asn Ala Lys Tyr Pro Tyr Asp Gin Ser Lys Glu Val Lys Ala 

260 265 270 

Lys Asp He Gly Lys Phe He Gin Asp Val Leu Asp Asp Lys Val Glu 
275 280 285 

Pro Ser He Lys Ser Glu Ala He Pro Glu Thr Gin Glu Gly Pro Val 
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290 

Thr Val Val Val Ala 
305 

Lys Asp Val Leu Leu 

325 

Ala Leu Ala Pro Lys 

340 

Pro Glu Val Thr lie 
355 

Asp Ser lie Thr Gly 
370 

Lys Asp Ser Pro Val 
385 

Ala Asn Phe Val Lys 

405 

Val Asp Pro Lys Lys 

420 

Ala Ala Ser Asp Glu 
435 

Ser Glu His Asp Glu 
450 



295 

His Ser Tyr Lys Asp Leu 
310 315 

Glu Phe Tyr Ala Pro Trp 

330 

Tyr Glu Glu Leu Ala Ser 

345 

Ala Lys He Asp Ala Thr 
360 

Phe Pro Thr He Lys Leu 
375 

Glu Tyr Glu Gly Ser Arg 
390 395 

Glu Asn Gly Lys His Lys 

410 

Glu Gin Glu Ser Gly Asp 

425 

Thr Glu Thr Pro Ala Ala 
440 

Leu 



300 

Val Leu Asp Asn Glu 

320 

cys Gly His Cys Lys 

335 

Leu Tyr Lys Asp He 
350 

Ala Asn Asp Val Pro 
365 

Phe Ala Ala Gly Ala 
380 

Thr Val Glu Asp Leu 

400 

Val Asp Ala Leu Glu 

415 

Ala Thr Glu Thr Arg 
430 

Thr Ser Asp Asp Lys 
445 



<210> 3 
<211> 54 
<212> PRT 

<213> Aspergillus oryzae 
<400> 3 

Lys Ala Leu Ala Pro Lys Tyr Glu Gin Ala Ala Thr Glu Leu Lys Glu 
15 10 15 

Lys Asn He Pro Leu Val Lys Val Asp Cys Thr Glu Glu Glu Ala Leu 

20 25 30 

Cys Arg Asp Gin Gly Val Glu Gly Tyr Pro Thr Leu Lys He Phe Arg 
35 40 45 
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Gly Leu Asp Ala Val Lys 
50 



<210> 4 
<211> 70 
<212> PRT 

<213> Aspergillus oryzae 
<400> 4 

Lys Ala Leu Ala Pro Lys Tyr Glu Gin Ala Ala Thr Glu Leu Lys Glu 
15 10 15 

Lys Asn He Pro Leu Val Lys Val Asp Cys Thr Glu Glu Glu Ala Leu 

20 25 30 

Cys Arg Asp Gin Gly Val Glu Gly Tyr Pro Thr Leu Lys He Phe Arg 
35 40 45 

Gly Leu Asp Ala Val Lys Pro Tyr Gin Gly Ala Arg Gin Thr Glu Ala 
50 55 60 

He Val Ser Tyr Met Val 
65 70 



<210> 5 
<211> 80 
<212> PRT 

<213> Aspergillus oryzae 
<400> 5 

'Lys Ala Leu Ala Pro Lys Tyr Glu 
1 5 

Lys Asn He Pro Leu Val Lys Val 

20 

Cys Arg Asp Gin Gly Val Glu Gly 

35 40 

Gly Leu Asp Ala Val Lys Pro Tyr 
50 55 

He Val Ser Tyr Met Val Lys Gin 



Gin Ala Ala Thr Glu Leu Lys Glu 

10 15 

Asp Cys Thr Glu Glu Glu Ala Leu 
25 30 

Tyr Pro Thr Leu Lys He Phe Arg 

45 

Gin Gly Ala Arg Gin Thr Glu Ala 

60 

Ser Leu Pro Ala Val Ser Pro Val 
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65 



70 



75 



80 



<210> 6 
<211> 82 
<212> PRT 

<213> Aspergillus oryzae 
<400> 6 

Lys Ala Leu Ala Pro Lys Tyr Glu Gin Ala Ala Thr Glu Leu Lys Glu 
15 10 15 

Lys Asn lie Pro Leu Val Lys Val Asp Cys Thr Glu Glu Glu Ala Leu 

20 25 30 

Cys Arg Asp Gin Gly Val Glu Gly Tyr Pro Thr Leu Lys lie Phe Arg 
35 40 45 

Gly Leu Asp Ala Val Lys Pro Tyr Gin Gly Ala Arg Gin Thr Glu Ala 
50 55 60 

lie Val Ser Tyr Met Val Lys Gin Ser Leu Pro Ala Val Ser Pro Val 
65 70 75 80 

Thr Pro 



<210> 7 
<211> 102 
<212> PRT 

<213> Aspergillus oryzae 
<400> 7 

Lys Ala Leu Ala Pro Lys Tyr Glu Gin Ala Ala Thr Glu Leu Lys Glu 
15 10 15 

Lys Asn lie Pro Leu Val Lys Val Asp Cys Thr Glu Glu Glu Ala Leu 

20 25 30 

Cys Arg Asp Gin Gly Val Glu Gly Tyr Pro Thr Leu Lys lie Phe Arg 
35 40 45 
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Gly Leu Asp Ala Val 
50 

He Val Ser Tyr Met 
65 

Thr Pro Glu Asn Leu 

85 

He Gly Tyr He Ala 

100 



Lys Pro Tyr Gin Gly Ala 
55 

Val Lys Gin Ser Leu Pro 
70 75 

Glu Glu He Lys Thr Met 

90 

Ser 



Arg Gin Thr Glu Ala 
60 

Ala Val Ser Pro Val 

80 

Asp Lys He Val Val 

95 



<210> 8 
<211> 113 
<212> PRT 

<213> Aspergillus oryzae 
<400> 8 

Lys Ala Leu Ala Pro Lys Tyr Glu Gin Ala Ala Thr Glu Leu Lys Glu 
1 5 10 15 

Lys Asn He Pro Leu Val Lys Val Asp Cys Thr Glu Glu Glu Ala Leu 

20 25 30 

Cys Arg Asp Gin Gly Val Glu Gly Tyr Pro Thr Leu Lys He Phe Arg 
35 40 45 

Gly Leu Asp Ala Val Lys Pro Tyr Gin Gly Ala Arg Gin Thr Glu Ala 
50 55 60 

He Val Ser Tyr Met Val Lys Gin Ser Leu Pro Ala Val Ser Pro Val 
65 70 75 80 

Thr Pro Glu Asn Leu Glu Glu He Lys Thr Met Asp Lys He Val Val 

85 90 95 

He Gly Tyr He Ala Ser Asp Asp Gin Thr Ala Asn Asp lie Phe Thr 

100 105 110 

Thr 



<210> 9 
<211> 220 
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<212> PRT 

<213> Aspergillus oryzae 
<400> 9 

Lys Ala Leu Ala Pro Lys Tyr Glu Gin Ala Ala Thr Glu Leu Lys Glu 
1 5 10 15 

Lys Asn lie Pro Leu Val Lys Val Asp Cys Thr Glu Glu Glu Ala Leu 

20 25 30 

Cys Arg Asp Gin Gly Val Glu Gly Tyr Pro Thr Leu Lys He Phe Arg 
35 40 45 

Gly Leu Asp Ala Val Lys Pro Tyr Gin Gly Ala Arg Gin Thr Glu Ala 
50 55 60 

He Val Ser Tyr Met Val Lys Gin Ser Leu Pro Ala Val Ser Pro Val 
65 70 75 80 

Thr Pro Glu Asn Leu Glu Glu He Lys Thr Met Asp Lys He Val Val 

85 90 95 

He Gly Tyr He Ala Ser Asp Asp Gin Thr Ala Asn Asp He Phe Thr 

100 105 HO 

Thr Phe Ala Glu Ser Gin Arg Asp Asn Tyr Leu Phe Ala Ala Thr Ser 
115 120 125 

Asp Ala Ser He Ala Lys Ala Glu Gly Val Lys Gin Pro Ser He Val 
130 135 140 

Leu Tyr Lys Asp Phe Asp Glu Lys Lys Ala Thr Tyr Asp Gly Glu He 
145 150 155 160 

Glu Gin Asp Ala Leu Leu Ser Trp Val Lys Thr Ala Ser Thr Pro Leu 

165 170 175 

Val Gly Glu Leu Gly Pro Glu Thr Tyr Ser Gly Tyr He Thr Ala Gly 

180 185 190 

He Pro Leu Ala Tyr He Phe Ala Glu Thr Lys Glu Glu Arg Glu Gin 
195 200 205 

Phe Thr Glu Glu Phe Lys Phe He Ala Glu Lys His 
210 215 220 
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<210> 10 
<211> 450 
<212> PRT 

<213> Aspergillus oryzae 
<400> 10 

Lys Ala Leu Ala Pro Lys Tyr Glu Gin Ala Ala Thr Glu Leu Lys Glu 
1 5 10 15 

Lys Asn lie Pro Leu Val Lys Val Asp Cys Thr Glu Glu Glu Ala Leu 

20 25 30 

Cys Arg Asp Gin Gly Val Glu Gly Tyr Pro Thr Leu Lys lie Phe Arg 
35 40 45 

Gly Leu Asp Ala Val Lys Pro Tyr Gin Gly Ala Arg Gin Thr Glu Ala 
50 55 60 

He Val Ser Tyr Met Val Lys Gin Ser Leu Pro Ala Val Ser Pro Val 
65 70 75 80 

Thr Pro Glu Asn Leu Glu Glu He Lys Thr Met Asp Lys He Val Val 

85 90 95 

He Gly Tyr He Ala Ser Asp Asp Gin Thr Ala Asn Asp lie Phe Thr 

100 105 110 

Thr Phe Ala Glu Ser Gin Arg Asp Asn Tyr Leu Phe Ala Ala Thr Ser 
115 120 125 

Asp Ala Ser He Ala Lys Ala Glu Gly Val Lys Gin Pro Ser He Val 
130 135 140 

Leu Tyr Lys Asp Phe Asp Glu Lys Lys Ala Thr Tyr Asp Gly Glu He 
145 150 155 160 

Glu Gin Asp Ala Leu Leu Ser Trp Val Lys Thr Ala Ser Thr Pro Leu 

165 170 175 

Val Gly Glu Leu Gly Pro Glu Thr Tyr Ser Gly Tyr He Thr Ala Gly 

180 185 190 

He Pro Leu Ala Tyr He Phe Ala Glu Thr Lys Glu Glu Arg Glu Gin 
195 200 205 

Phe Thr Glu Glu Phe Lys Phe He Ala Glu Lys His Lys Gly Ser He 
210 215 220 
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Asn lie Val Thr He Asp Ala Lys Leu Tyr Gly Ala His Ala Gly Asn 
225 230 235 240 

Leu Asn Leu Asp Pro Ser Lys Phe Pro Ala Phe Ala He Gin Asp Pro 

245 250 255 

Glu Lys Asn Ala Lys Tyr Pro Tyr Asp Gin Ser Lys Glu Val Lys Ala 

260 265 270 

Lys Asp He Gly Lys Phe He Gin Asp Val Leu Asp Asp Lys Val Glu 
275 280 285 

Pro Ser He Lys Ser Glu Ala He Pro Glu Thr Gin Glu Gly Pro Val 
290 295 300 

Thr Val Val Val Ala His Ser Tyr Lys Asp Leu Val Leu Asp Asn Glu 
305 310 315 320 

Lys Asp Val Leu Leu Glu Phe Tyr Ala Pro Trp Cys Gly His Cys Lys 

325 330 335 

Ala Leu Ala Pro Lys Tyr Glu Glu Leu Ala Ser Leu Tyr Lys Asp He 

340 345 350 

Pro Glu Val Thr He Ala Lys He Asp Ala Thr Ala Asn Asp Val Pro 
355 360 365 

Asp Ser He Thr Gly Phe Pro Thr He Lys Leu Phe Ala Ala Gly Ala 
370 375 380 

Lys Asp Ser Pro Val Glu Tyr Glu Gly Ser Arg Thr Val Glu Asp Leu 
385 390 395 400 

Ala Asn Phe Val Lys Glu Asn Gly Lys His Lys Val Asp Ala Leu Glu 

405 410 415 

Val Asp Pro Lys Lys Glu Gin Glu Ser Gly Asp Ala Thr Glu Thr Arg 

420 425 430 

Ala Ala Ser Asp Glu Thr Glu Thr Pro Ala Ala Thr Ser Asp Asp Lys 
435 440 445 

Ser Glu 
450 



<210> 11 
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<211> 331 
<212> PRT 

<213> Aspergillus oryzae 
<400> 11 

Lys Ala Leu Ala Pro Lys Tyr Glu Gin Ala Ala Thr Glu Leu Lys Glu 
15 10 15 

Lys Asn lie Pro Leu Val Lys Val Asp Cys Thr Glu Glu Glu Ala Leu 

20 25 30 

Cys Arg Asp Gin Gly Val Glu Gly Tyr Pro Thr Leu Lys lie Phe Arg 
35 40 45 

Gly Leu Asp Ala Val Lys Pro Tyr Gin Gly Ala Arg Gin Thr Glu Ala 
50 55 60 

lie Val Ser Tyr Met Val Lys Gin Ser Leu Pro Ala Val Ser Pro Val 
65 70 75 80 

Thr Pro Glu Asn Leu Glu Glu lie Lys Thr Met Asp Lys He Val Val 

85 90 95 

He Gly Tyr He Ala Ser Asp Asp Gin Thr Ala Asn Asp He Phe Thr 

100 105 110 

Thr Phe Ala Glu Ser Gin Arg Asp Asn Tyr Leu Phe Ala Ala Thr Ser 
115 120 125 

Asp Ala Ser He Ala Lys Ala Glu Gly Val Lys Gin Pro Ser He Val 
130 135 140 

Leu Tyr Lys Asp Phe Asp Glu Lys Lys Ala Thr Tyr Asp Gly Glu He 
145 150 155 160 

Glu Gin Asp Ala Leu Leu Ser Trp Val Lys Thr Ala Ser Thr Pro Leu 

165 170 175 

Val Gly Glu Leu Gly Pro Glu Thr Tyr Ser Gly Tyr He Thr Ala Gly 

180 185 190 

He Pro Leu Ala Tyr He Phe Ala Glu Thr Lys Glu Glu Arg Glu Gin 
195 200 205 

Phe Thr Glu Glu Phe Lys Phe lie Ala Glu Lys His Lys Gly Ser He 
210 215 220 

Asn He Val Thr lie Asp Ala Lys Leu Tyr Gly Ala His Ala Gly Asn 
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225 

Leu Asn Leu Asp 



Glu Lys Asn Ala 

260 

Lys Asp lie Gly 
275 

Pro Ser lie Lys 
290 

Thr Val Val Val 
305 

Lys Asp Val Leu 



230 

Pro Ser Lys Phe 
245 

Lys Tyr Pro Tyr 



Lys Phe lie Gin 

280 

Ser Glu Ala lie 
295 

Ala His Ser Tyr 
310 

Leu Glu Phe Tyr 
325 



235 

Pro Ala Phe Ala 
250 

Asp Gin Ser Lys 
265 

Asp Val Leu Asp 



Pro Glu Thr Gin 

300 

Lys Asp Leu Val 
315 

Ala Pro Trp 
330 



240 

lie Gin Asp Pro 
255 

Glu Val Lys Ala 
270 

Asp Lys Val Glu 
285 

Glu Gly Pro Val 



Leu Asp Asn Glu 

320 



<210> 12 
<211> 1953 
<212> DNA 

<213> Aspergillus oryzae 
<400> 12 

ccctgctgtc cccatagaca gtacacacgt catcctttga tattgtcaca cttgacaaat 60 

tcccgacacc atgcggactt tcgcaccttg gatcttgagc cttctagggg cttctgctgt 120 

agcttctgct gccgatgcga ctgccgaagc tccctccgat gtggtctcgc tcaccgggga 180 

cacattcgaa actttcgtca aggagcatga cctagttttg gccgagtttt ttgctccctg 240 

gtgtggccat tgcaaggctc tcgctccgaa atacgagcag gccgccactg agttaaagga 300 

aaagaacatt ccgctggtca aggttgattg caccgaggaa gaggctcttt gtagggacca 360 

aggtgttgaa ggttacccca cgctgaagat tttccgtggc cttgacgctg ttaagcctta 420 

tcagggagct cgtcagaccg aggcgtaagt gtcacctgtt tgttagcctt gctcaaataa 480 

tattgaccgc tagtatcata ggattgtttc atacatggtc aagcagtcac tacctgctgt 540 

gtcccctgtc accccagaaa acctcgaaga gatcaagact atggacaaga ttgtcgttat 600 

tggttatatc gcgtctgacg accagactgc caatgatata ttcaccactt ttgccgagtc 660 

acagagagac aactacctct tcgccgccac aagtgatgca tcgatcgcta aggcagaagg 720 

tgttaagcaa ccttcgattg ttctctataa agacttcgat gaaaagaaag ctacttatga 780 

tggagagatt gaacaggatg ccctcctcag ttgggtcaag actgccagta cccccttggt 840 

gggcgagctg ggcccagaga cttactccgg atatataacg gtatgtcaca agacacaatc 900 

tcaatatcgc ttcacaacgt ttagtaaata atcatgagtt tctgacatgg gtttggttaa 960 

ggctggcatt ccactggcgt acattttcgc cgaaaccaaa gaagagcgtg agcagttcac 1020 

cgaggagttc aagttcatcg ccgagaaaca caagggttcc atcaatattg tcaccattga 1080 

cgccaagttg tacggcgctc atgcaggcaa tctcaacctt gacccctcca agttccctgc 1140 

attcgctatt caagaccctg aaaagaacgc caagtatcct tatgaccagt cgaaggaagt 1200 



11 



WO 00/70064 



PCT/DK00/00265 



caaggccaag gatatcggta aattcatcca agacgttctt gatgataaag tagagccaag 1260 
cattaagtct gaggctattc ctgagactca ggaaggtcct gttactgttg ttgtcgcgca 1320 
ttcctataag gatctcgtcc ttgacaacga gaaggacgtc cttctcgaat tttatgcgcc 1380 
atggtgcgga cactgcaagg cgtaagtttt ccccctcttt ctctacaacg aattatatcc 1440 
actctcgctt gcgaatacct aattaaacct tgaatagcct tgccccgaag tacgaggaac 1500 
ttgcaagcct ttacaaggat attcctgaag ttaccatcgc caaaattgac gcaacggcca 1560 
acgatgtccc cgactccatt acaggatttc ctactattaa gctcttcgct gccggcgcca 1620 
aggactcccc agttgaatat gaaggctctc gcacggtgga ggacctcgcc aacttcgtca 1680 
aggagaatgg caagcacaag gtcgatgctc ttgaagttga tccgaagaaa gaacaggaga 1740 
gtggcgatgc caccgagact cgggccgcct ctgacgagac cgaaactcct gctgctacta 1800 
gcgatgacaa gtctgagcat gatgaattgt aaatttcatt tggcctgata gtttgatcca 1860 
tatttatgtg aattcttgta ttctaccagc agtttgagca atcgcagcta cttccggctt 1920 
aggaaactgt tgttctatcc tagtgggaag ctt 1953 



<210> 13 
<211> 515 
<212> PRT 

<213> Aspergillus oryzae 
<400> 13 

Met Arg Thr Phe Ala Pro Trp lie Leu Ser Leu Leu Gly Ala Ser Ala 
! 5 10 15 

Val Ala Ser Ala Ala Asp Ala Thr Ala Glu Ala Pro Ser Asp Val Val 

20 25 30 

Ser Leu Thr Gly Asp Thr Phe Glu Thr Phe Val Lys Glu His Asp Leu 
35 40 45 

Val Leu Ala Glu Phe Phe Ala Pro Trp Cys Gly His Cys Lys Ala Leu 
50 55 60 

Ala Pro Lys Tyr Glu Gin Ala Ala Thr Glu Leu Lys Glu Lys Asn lie 
65 70 75 80 

Pro Leu Val Lys Val Asp Cys Thr Glu Glu Glu Ala Leu Cys Arg Asp 

85 90 95 

Gin Gly Val Glu Gly Tyr Pro Thr Leu Lys lie Phe Arg Gly Leu Asp 

100 105 110 

Ala Val Lys Pro Tyr Gin Gly Ala Arg Gin Thr Glu Ala lie Val Ser 
115 120 125 

Tyr Met Val Lys Gin Ser Leu Pro Ala Val Ser Pro Val Thr Pro Glu 
130 135 140 
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Asn Leu Glu Glu lie 
145 

lie Ala Ser Asp Asp 

165 

Glu Ser Gin Arg Asp 

180 

lie Ala Lys Ala Glu 
195 

Asp Phe Asp Glu Lys 
210 

Ala Leu Leu Ser Trp 
225 

Leu Gly Pro Glu Thr 

245 

Ala Tyr lie Phe Ala 

260 

Glu Phe Lys Phe lie 
275 

Thr lie Asp Ala Lys 
290 

Asp Pro Ser Lys Phe 
305 

Ala Lys Tyr Pro Tyr 

325 

Gly Lys Phe lie Gin 

340 

Lys Ser Glu Ala lie 
355 

Val Ala His Ser Tyr 
370 

Leu Leu Glu Phe Tyr 
385 



Lys Thr Met Asp Lys lie 
150 155 

Gin Thr Ala Asn Asp lie 

170 

Asn Tyr Leu Phe Ala Ala 

185 

Gly Val Lys Gin Pro Ser 
200 

Lys Ala Thr Tyr Asp Gly 
215 

Val Lys Thr Ala Ser Thr 
230 235 

Tyr Ser Gly Tyr lie Thr 

250 

Glu Thr Lys Glu Glu Arg 

265 

Ala Glu Lys His Lys Gly 
280 

Leu Tyr Gly Ala His Ala 
295 

Pro Ala Phe Ala lie Gin 
310 315 

Asp Gin Ser Lys Glu Val 

330 

Asp Val Leu Asp Asp Lys 

345 

Pro Glu Thr Gin Glu Gly 
360 

* 

Lys Asp Leu Val Leu Asp 
375 

Ala Pro Trp Cys Gly His 
390 395 



Val Val lie Gly Tyr 

160 

Phe Thr Thr Phe Ala 

175 

Thr Ser Asp Ala Ser 
190 

lie Val Leu Tyr Lys 
205 

Glu He Glu Gin Asp 
220 

Pro Leu Val Gly Glu 

240 

Ala Gly He Pro Leu 

255 

Glu Gin Phe Thr Glu 
270 

Ser He Asn He Val 
285 

Gly Asn Leu Asn Leu 
300 

Asp Pro Glu Lys Asn 

320 

Lys Ala Lys Asp He 

335 

Val Glu Pro Ser He 
350 

Pro Val Thr Val Val 
365 

Asn Glu Lys Asp Val 
380 

Cys Lys Ala Leu Ala 

400 
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Pro Lys Tyr Glu 



Thr He Ala Lys 

420 

Thr Gly Phe Pro 
435 

Pro Val Glu Tyr 
450 

Val Lys Glu Asn 
465 

Lys Lys Glu Gin 



Asp Glu Thr Glu 

500 

Asp Glu Leu 
515 



Glu Leu Ala Ser 
405 

He Asp Ala Thr 



Thr He Lys Leu 

440 

Glu Gly Ser Arg 
455 

Gly Lys His Lys 
470 

Glu Ser Gly Asp 
485 

Thr Pro Ala Ala 



Leu Tyr Lys Asp 
410 

Ala Asn Asp Val 
425 

Phe Ala Ala Gly 



Thr Val Glu Asp 

460 

Val Asp Ala Leu 
475 

Ala Thr Glu Thr 
490 

Thr Ser Asp Asp 
505 



He Pro Glu Val 
415 

Pro Asp Ser He 
430 

Ala Lys Asp Ser 
445 

Leu Ala Asn Phe 



Glu Val Asp Pro 

480 

Arg Ala Ala Ser 
495 

Lys Ser Glu His 
510 



<210> 14 
<211> 843 
<212> DNA 

<213> Aspergillus oryzae 
<400> 14 

atgcggactt tcgcaccttg gatcttgagc cttctagggg cttctgctgt agcttctgct 60 

gccgatgcga ctgccgaagc tccctccgat gtggtctcgc tcaccgggga cacattcgaa 120 

actttcgtca aggagcatga cctagttttg gccgagtttt ttgctccctg gtgtggccct 180 

tgcaaggctc tcgctccgaa atacgagcag gccgccactg agttaaagga aaagaacatt 240 

ccgctggtca aggttgattg caccgaggaa gaggctcttt gtagggacca aggtgttgaa 300 

ggttacccca cgctgaagat tttccgtggc cttgacgctg ttaagcctta tcagggagct 360 

cgtcagaccg aggcgattgt ttcatacatg gtcaagcagt cactacctgc tgtgtcccct 420 

gtcaccccag aaaacctcga agagatcaag actatggaca agattgtcgt tattggttat 480 

atcgcgtctg acgaccagac tgccaatgat atattcacca cttttgccga gtcacagaga 540 

gacaactacc tcttcgccgc cacaagtgat gcatcgatcg ctaaggcaga aggtgttaag 600 

caaccttcga ttgttctcta taaagacttc gatgaaaaga aagctactta tgatggagag 660 

attgaacagg atgccctcct cagttgggtc aagactgcca gtaccccctt ggtgggcgag 720 

ctgggcccag agacttactc cggatatata acggctggca ttccactggc gtacattttc 780 

gccgaaacca aagaagagcg tgagcagttc accgaggagt tcaagttcat cgccgagaaa 840 

cac 843 
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<210> 15 
<211> 281 
<212> PRT 

<213> Aspergillus oryzae 
<400> 15 

Met Arg Thr Phe Ala Pro Trp lie Leu Ser Leu Leu Gly Ala Ser Ala 
1 5 10 15 

Val Ala Ser Ala Ala Asp Ala Thr Ala Glu Ala Pro Ser Asp Val Val 

20 25 30 

Ser Leu Thr Gly Asp Thr Phe Glu Thr Phe Val Lys Glu His Asp Leu 
35 40 45 

Val Leu Ala Glu Phe Phe Ala Pro Trp Cys Gly Pro Cys Lys Ala Leu 
50 55 60 

Ala Pro Lys Tyr Glu Gin Ala Ala Thr Glu Leu Lys Glu Lys Asn He 
65 70 75 80 

Pro Leu Val Lys Val Asp Cys Thr Glu Glu Glu Ala Leu Cys Arg Asp 

85 90 95 

Gin Gly Val Glu Gly Tyr Pro Thr Leu Lys He Phe Arg Gly Leu Asp 

100 105 110 

Ala Val Lys Pro Tyr Gin Gly Ala Arg Gin Thr Glu Ala He Val Ser 
115 120 125 

Tyr Met Val Lys Gin Ser Leu Pro Ala Val Ser Pro Val Thr Pro Glu 
130 135 140 

Asn Leu Glu Glu He Lys Thr Met Asp Lys He Val Val He Gly Tyr 
145 150 155 160 

He Ala Ser Asp Asp Gin Thr Ala Asn Asp He Phe Thr Thr Phe Ala 

165 170 175 

Glu Ser Gin Arg Asp Asn Tyr Leu Phe Ala Ala Thr Ser Asp Ala Ser 

180 185 190 

He Ala Lys Ala Glu Gly Val Lys Gin Pro Ser He Val Leu Tyr Lys 
195 200 205 

Asp Phe Asp Glu Lys Lys Ala Thr Tyr Asp Gly Glu He Glu Gin Asp 
210 215 220 
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Ala Leu Leu Ser 
225 

Leu Gly Pro Glu 



Ala Tyr lie Phe 

260 

Glu Phe Lys Phe 
275 



Trp Val Lys Thr 
230 

Thr Tyr Ser Gly 
245 

Ala Glu Thr Lys 



lie Ala Glu Lys 

280 



Ala Ser Thr Pro 
235 

Tyr He Thr Ala 
250 

Glu Glu Arg Glu 
265 

His 



Leu Val Gly Glu 

240 

Gly He Pro Leu 
255 

Gin Phe Thr Glu 
270 



<210> 16 
<211> 285 
<212> DNA 

<213> Aspergillus oryzae 

gtggtctcgc tcaccgggga cacattcgaa 60 
gccgagtttt ttgctccctg gtgtggccat 120 
gccgccactg agttaaagga aaagaacatt 180 
gaggctcttt gtagggacca aggtgttgaa 240 
cttgacgctg ttaag 285 



<400> 16 

gccgatgcga 

actttcgtca 

tgcaaggctc 

ccgctggtca 

ggttacccca 



ctgccgaagc 
aggagcatga 
tcgctccgaa 
aggttgattg 
cgctgaagat 



tccctccgat 
cctagttttg 
atacgagcag 
caccgaggaa 
tttccgtggc 



<210> 17 
<211> 95 
<212> PRT 

<213> Aspergillus oryzae 
<400> 17 

Ala Asp Ala Thr Ala Glu Ala Pro Ser Asp Val Val Ser Leu Thr Gly 
15 10 15 

Asp Thr Phe Glu Thr Phe Val Lys Glu His Asp Leu Val Leu Ala Glu 

20 25 30 

Phe Phe Ala Pro Trp Cys Gly Pro Cys Lys Ala Leu Ala Pro Lys Tyr 
35 40 45 

Glu Gin Ala Ala Thr Glu Leu Lys Glu Lys Asn He Pro Leu Val Lys 
50 55 60 

Val Asp Cys Thr Glu Glu Glu Ala Leu Cys Arg Asp Gin Gly Val Glu 
65 70 75 80 
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Gly Tyr Pro Thr Leu Lys He Phe Arg Gly Leu Asp Ala Val Lys 

85 90 95 



<210> 18 
<211> 102 
<212> PRT 

<213> Aspergillus oryzae 
<400> 18 

Gly Pro Val Thr Val Val Val Ala His Ser Tyr Lys Asp Leu Val Leu 
15 10 15 

Asp Asn Glu Lys Asp Val Leu Leu Glu Phe Tyr Ala Pro Trp Cys Gly 

20 25 30 

His Cys Lys Ala Leu Ala Pro Lys Tyr Glu Glu Leu Ala Ser Leu Tyr 
35 40 45 

Lys Asp He Pro Glu Val Thr He Ala Lys He Asp Ala Thr Ala Asn 
50 55 60 

r 

Asp Val Pro Asp Ser He Thr Gly Phe Pro Thr He Lys Leu Phe Ala 
65 70 75 80 

Ala Gly Ala Lys Asp Ser Pro Val Glu Tyr Glu Gly Ser Arg Thr Val 

85 90 95 

Glu Asp Leu Ala Asn Phe 

100 



<210> 19 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : 142779 
<400> 19 

ttgaattgaa aatagattga tttaaaactt c 31 



<210> 20 
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<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 142780 
<400> 20 

ttgcatgcgt aatcatggtc atagc 25 

<210> 21 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : 140288 



<210> 22 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : 142778 
<400> 22 

aaatcaatct attttcaatt caattcatca tt 32 



<210> 23 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : 160712 



<400> 21 



ttgaattcat gggtaataac tgatat 



26 



<400> 23 



cccttgcaag gctctcgctc c 



21 



<210> 24 



18 



WO 00/70064 



PCT/DKOO/00265 



<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : 18699 
<400> 24 

ttgccctcat ccccatcctt t 
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